
Stefano Ermon
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
The Race to Production-Grade Diffusion LLMs with Stefano Ermon
- Published
- March 26, 2026
- Duration
- 1h 3m
- Summary source
- description
- Last updated
- Jul 5, 2026
Discusses llm.
Summary
Today, we're joined by Stefano Ermon, associate professor at Stanford University and CEO of Inception Labs to discuss diffusion language models. We dig into how diffusion approaches—traditionally used for images—are being adapted for text and code generation, the technical challenges of applying continuous methods to discrete token spaces, and how diffusi…
Intelligent Report
Sign in to read teasers, or upgrade to Research Pro to commission intelligent report for this episode. Learn more →
Show notes
Today, we're joined by Stefano Ermon, associate professor at Stanford University and CEO of Inception Labs to discuss diffusion language models. We dig into how diffusion approaches—traditionally used for images—are being adapted for text and code generation, the technical challenges of applying continuous methods to discrete token spaces, and how diffusion models compare to traditional autoregressive LLMs. Stefano introduces Mercury 2, a commercial-scale diffusion LLM that can generate multiple
Themes
- llm