News
How Transparent Is Diffusion Gemma (and why it matters) " Less Wrong
2+ hour, 18+ min ago (20+ words) Authors: Joshua Engels*, Callum Mc Dougall*, Bilal Chughtai*, Janos Kramar, Senthoran Rajamanoharan, Cindy Wu, Arthur Conmy, Asic Q Chen, Jean Tarbour...
dialog with Gemini: would indexing have rescued most victims of 2008 subprime crisis? " Less Wrong
1+ hour, 22+ min ago (877+ words) I have been fairly certain that conversion of the sub-prime mortgages in the 2008 crisis to indexed mortgages for their remaining balances would have largely made them viable. Is this accurate? Yes, your assessment is analytically sound. Converting the remaining balances…...
Against Planet-Eating Nanoreplicators " Less Wrong
1+ hour, 56+ min ago (411+ words) A classic trope of hard sci-fi as well as more serious futurism is using self-replicating nanoassemblers to convert planets of the Solar System to computronium, or some other kind of a Dyson swarm. This is almost the default way to…...
[Linkpost] How Transparent Is Diffusion Gemma (and why it matters) " Less Wrong
2+ hour, 18+ min ago (20+ words) Work also done with Cindy Wu, Asic Q Chen, Jean Tarbouriech, Min Ma, Brendan O'Donoghue, Jo'o Gabriel Lopes de Oliveira. "...
The Invisible Side of AI Governance " Less Wrong
3+ hour, 29+ min ago (1562+ words) Tldr: Most strategic writing on AI governance on Less Wrong describes the outsider game, which is most often visible: press, statements, open letters. Here I want to describe the other, invisible half: the insider work within ministerial cabinets and international…...
Would anybody here be interested in a "mistake postmortem" discussion group? " Less Wrong
10+ hour, 20+ min ago (232+ words) I recently made a dumb (in retrospect) mistake that set me back a lot. Feeling upset and regretful, I spoke to an older family member who reassured me, "yeah, unfortunately there's no way around it; we have to experience these…...
Thoughts on Likelihood of Existential Risks by Misaligned AIs " Less Wrong
23+ hour, 31+ min ago (304+ words) The implication of this is that it is very hard to have one concrete AI risk argument I can read and respond to. It is difficult to form opinions on AI safety when most experts are in great disagreement about…...
How I think developers of frontier AI systems and regulators ought to act in the face of existential AI risk " Less Wrong
1+ day, 1+ min ago (1088+ words) In a recent podcast episode published July 20, 2025, Anthropic co-founder Ben Mann is asked (at 48: 43) "What are the odds that we align AI correctly and actually solve this problem?" In his answer, Ben references the following part of Anthropic's March 8, 2023 blog…...
Why should AI be moral? " Less Wrong
23+ hour, 39+ min ago (1067+ words) In outline, the moral skeptic's challenge goes: To respond, one must either refute the skeptical hypothesis or identify an extra-moral reason to accept morality. Without a response, one's acceptance of morality is unjustified. This position threatens to be reflectively destabilizing…...
World-modeling the US vs. Anthropic Standoff on Claude Fable " Less Wrong
1+ day, 2+ hour ago (872+ words) I spent the last two days doing a deep dive in forecasting outcomes of the US forcing Anthropic to take down Claude Fable. I did this for two reasons...