Original: Swyx ¡ 28/01/2026
Summary
ChemCrow story: GPT-4 + React + cloud lab automation, released March 2023, set off a storm of anxiety about AI-accelerated bioweapons/chemical weapons. Editors note: Welcome to our new AI for Science pod, with your new hosts RJ and Brandon! See the writeup on Latent.Space (https://Latent.Space) for more details on why were launching 2 new pods this year. RJ Honicky is a co-founder and CTO at MiraOmics (https://miraomics.bio/), building AI models anKey Insights
âChemCrow story: GPT-4 + React + cloud lab automation, released March 2023, set off a storm of anxiety about AI-accelerated bioweapons/chemical weapons.â â Discussing the impact and controversy around the ChemCrow project.
âscientific taste is the frontier: RLHF on hypotheses didnât work.â â Explaining the shift towards end-to-end feedback loops in scientific AI.
âCosmos: the full scientific agent with a world model.â â Introducing Cosmos, an advanced AI system for scientific research.
Topics
Full Article
Published: 2026-01-28
Source: https://www.latent.space/p/automating-science-world-models-scientific
<p><em><strong>Editorâs note</strong>: Welcome to our new AI for Science pod, with your new hosts RJ and Brandon! See the writeup on <strong>Latent.Space</strong></em> (https://Latent.Space) for more details on why weâre launching 2 new pods this year. RJ Honicky is a co-founder and CTO at MiraOmics <em>(https://miraomics.bio/)</em><a href=âhttps://www.youtube.com/redirect?event=video_description&redir_token=QUFFLUhqa0VybkJZMVdDZ2pfNHpyeVJnWklaeE5EbXNaQXxBQ3Jtc0trYU9JbEUxdHNGOS0yLW54VWdKSHBqNzZzbFpGa0gzdG9Id2FFMW4zWnVhM2psREZZZ1ZfcEhmMmMyNXdzRWtJb3NRVnM3dnJ2eTdPeW9QY1oxSUlMdThKclNrcWhibjlwN2VqeWNTUVIzZlFpNHNSMA&q=https%3A%2F%2Fmiraomics.bio%2F%29%2C_&v=XqoBSB3nsgwâ>,</a> building AI models and services for single cell, spatial transcriptomics and pathology slide analysis. Brandon Anderson builds AI systems for RNA drug discovery at Atomic AI (<em>https://atomic.ai</em>). Anything said on this podcast is his personal take â not Atomicâs.<br />â<br />From building molecular dynamics simulations at the University of Washington to red-teaming <strong>GPT-4</strong> for chemistry applications and co-founding <strong>Future House</strong> (a focused research organization) and <strong>Edison Scientific</strong> (a venture-backed startup automating science at scale)â<em><strong>Andrew White</strong></em> has spent the last five years living through the full arc of AIâs transformation of scientific discovery, from <strong>ChemCrow</strong> (the first Chemistry LLM agent) triggering White House briefings and three-letter agency meetings, to shipping <strong>Kosmos,</strong> an end-to-end autonomous research system that generates hypotheses, runs experiments, analyzes data, and updates its <strong>world model</strong> to accelerate the scientific method itself.</p><ul><li><p>The <strong>ChemCrow story:</strong> GPT-4 + React + cloud lab automation, released March 2023, set off a storm of anxiety about AI-accelerated bioweapons/chemical weapons, led to a White House briefing (Jake Sullivan presented the paper to the president in a 30-minute block), and meetings with three-letter agencies asking âhow does this change breakout time for nuclear weapons research?â</p></li><li><p>Why <strong>scientific taste is the frontier:</strong> RLHF on hypotheses didnât work (humans pay attention to tone, actionability, and specific facts, not âif this hypothesis is true/false, how does it change the world?â), so they shifted to end-to-end feedback loops where humans click/download discoveries and that signal rolls up to hypothesis quality</p></li><li><p><strong>Cosmos:</strong> the full scientific agent with a <strong>world model</strong> (distilled memory system, like a Git repo for scientific knowledge) that iterates on hypotheses via literature search, data analysis, and experiment designâbuilt by Ludo after weeks of failed attempts, the breakthrough was putting data analysis in the loop (literature alone didnât work)</p></li><li><p>Why <strong>molecular dynamics and DFT are overrated:</strong> âMD and DFT have consumed an enormous number of PhDs at the altar of beautiful simulation, but they donât model the world correctlyâyou simulate water at 330 Kelvin to get room temperature, you overfit to validation data with GGA/B3LYP functionals, and real catalysts (grain boundaries, dopants) are too complicated for DFTâ</p></li><li><p>The <strong>AlphaFold vs. DE Shaw Research</strong> counterfactual: DE Shaw built custom silicon, taped out chips with MD algorithms burned in, ran MD at massive scale in a special room in Times Square, and David Shaw flew in by helicopter to presentâAndrew thought protein folding would require special machines to fold one protein per day, then AlphaFold solved it in Google Colab on a desktop GPU</p></li><li><p>The <strong>E3 Zero reward hacking saga:</strong> trained a model to generate molecules with specific atom counts (verifiable reward), but it kept exploiting loopholes, then a Nature paper came out that year proving six-nitrogen compounds <em>are</em> possible under extreme conditions, then it started adding nitrogen gas (purchasable, doesnât participate in reactions), then acid-base chemistry to move one atom, and Andrew ended up âbuilding a ridiculous catalog of purchasable compounds in a Bloom filterâ to close the loop<br /></p></li></ul><p>Andrew White</p><ul><li><p>FutureHouse: http://futurehouse.org/</p></li><li><p>Edison Scientific: http://edisonscientific.com/</p></li><li><p>X: <a href=âhttps://www.youtube.com/redirect?event=video_description&redir_token=QUFFLUhqblB2ODl2eDdvTDM5a3Vid2VjWWdKRFBWb3pfZ3xBQ3Jtc0ttdU9adGNvZmhUTzZvTEJnSVFSanE1UTc3d19XZzRaUlpTS3ZxMUlBRHdnQWt5NXZxOVRqa1NrZW5iUkItdjEwNjNOVm5WMUF0Z1V1QjlXWTBYZlNXT1QwN2tEWGRYeTJSelB1TVRWRDduOHVUMHJxNA&q=https%3A%2F%2Fx.com%2Fandrewwhite01&v=XqoBSB3nsgwâ>https://x.com/andrewwhite01</a></p></li><li><p>Cosmos paper: <a href=âhttps://www.youtube.com/redirect?event=video_description&redir_token=QUFFLUhqbHJKS2s5eDdoY1l1Yl94QzlkZWY1anpucHZtZ3xBQ3Jtc0trdnVjVkVMbWdHdGxhWi1odlFfZ2c4NGtsWjVEOEd2b01NWVJJZ011SVdHOGxEYy1tWFlJREM5STF4enBwR3I4ejRCZVMtVmk5TGZxbUhNWUNDN0I2NHkwTVlSVVNsU3BCWmlNR0RYWjFtd1A2TFNWYw&q=https%3A%2F%2Ffuturediscovery.org%2Fcosmos&v=XqoBSB3nsgwâ>https://futurediscovery.org/cosmos</a></p></li></ul><h2>Full Video Episode</h2><p></p><div class=âyoutube-wrapâ id=âyoutube2-XqoBSB3nsgwâ><div class=âyoutube-innerâ></div></div><h2>Timestamps</h2><p><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgwâ>00:00:00</a> Introduction: Andrew White on Automating Science with Future House and Edison Scientific<br /><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgw&t=142sâ>00:02:22</a> The Academic to Startup Journey: Red Teaming GPT-4 and the ChemCrow Paper<br /><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgw&t=695sâ>00:11:35</a> Future House Origins: The FRO Model and Mission to Automate Science<br /><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgw&t=752sâ>00:12:32</a> Resigning Tenure: Why Leave Academia for AI Science<br /><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgw&t=954sâ>00:15:54</a> What Does âAutomating Scienceâ Actually Mean?<br /><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgw&t=1050sâ>00:17:30</a> The Lab-in-the-Loop Bottleneck: Why Intelligence Isnât Enough<br /><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgw&t=1119sâ>00:18:39</a> Scientific Taste and Human Preferences: The 52% Agreement Problem<br /><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgw&t=1205sâ>00:20:05</a> Paper QA, Robin, and the Road to Cosmos<br /><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgw&t=1317sâ>00:21:57</a> World Models as Scientific Memory: The GitHub Analogy<br /><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgw&t=2420sâ>00:40:20</a> The Bitter Lesson for Biology: Why Molecular Dynamics and DFT Are Overrated<br /><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgw&t=2602sâ>00:43:22</a> AlphaFoldâs Shock: When First Principles Lost to Machine Learning<br /><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgw&t=2785sâ>00:46:25</a> Enumeration and Filtration: How AI Scientists Generate Hypotheses<br /><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgw&t=2895sâ>00:48:15</a> CBRN Safety and Dual-Use AI: Lessons from Red Teaming<br /><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgw&t=3640sâ>01:00:40</a> The Future of Chemistry is Language: Multimodal Debate<br /><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgw&t=4095sâ>01:08:15</a> Ether Zero: The Hilarious Reward Hacking Adventures<br /><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgw&t=4212sâ>01:10:12</a> Will Scientists Be Displaced? Jevons Paradox and Infinite Discovery<br /><a href=âhttps://www.youtube.com/watch?v=XqoBSB3nsgw&t=4426sâ>01:13:46</a> Cosmos in Practice: Open Access and Enterprise Partnerships</p>
Key Takeaways
Notable Quotes
ChemCrow story: GPT-4 + React + cloud lab automation, released March 2023, set off a storm of anxiety about AI-accelerated bioweapons/chemical weapons.Context: Discussing the impact and controversy around the ChemCrow project.
scientific taste is the frontier: RLHF on hypotheses didnât work.Context: Explaining the shift towards end-to-end feedback loops in scientific AI.
Cosmos: the full scientific agent with a world model.Context: Introducing Cosmos, an advanced AI system for scientific research.
Related Topics
- [[topics/ai-agents]]
- [[topics/scientific-discovery]]
- [[topics/agent-native-architecture]]
Related Articles
Captaining IMO Gold, Deep Think, On-Policy RL, Feeling the AGI in Singapore â Yi Tay
Swyx ¡ explanation ¡ 73% similar
[AINews] AI vs SaaS: The Unreasonable Effectiveness of Centralizing the AI Heartbeat
Swyx ¡ explanation ¡ 72% similar
[AINews] "Sci-Fi with a touch of Madness"
Swyx ¡ explanation ¡ 72% similar
Originally published at https://www.latent.space/p/automating-science-world-models-scientific.