Results for "reasoning model"
8 results
Episodes
StandardSummaries onlyTeaching LLMs to Self-Reflect with Reinforcement Learning with Maohao Shen
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Maohao Shen· Apr 8, 2025
Today, we're joined by Maohao Shen, PhD student at MIT to discuss his paper, “Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search.” We dig into how Satori leverage…
llm
Intelligent report#490 – State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI
Lex Fridman Podcast· Feb 1, 2026
Sebastian Raschka and Nathan Lambert break down the AI landscape with Lex Friedman, covering open-weight models, Chinese lab competition, coding tools, and why transformer architectures haven't fundamentally changed desp…
llmmachine-learningsafety-alignmentsociety
StandardSummaries onlyInside s1: An o1-Style Reasoning Model That Cost Under $50 to Train with Niklas Muennighoff
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Niklas Muennighoff· Mar 3, 2025
Today, we're joined by Niklas Muennighoff, a PhD student at Stanford University, to discuss his paper, “S1: Simple Test-Time Scaling.” We explore the motivations behind S1, as well as how it compares to OpenAI's O1 and D…
openaigoogle-ai
StandardSummaries onlyAI Trends 2026: OpenClaw Agents, Reasoning LLMs, and More with Sebastian Raschka
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Sebastian Raschka· Feb 26, 2026
In this episode, Sebastian Raschka, independent LLM researcher and author, joins us to break down how the LLM landscape has changed over the past year and what is likely to matter most in 2026. We discuss the shift from …
llminference
StandardSummaries onlyScaling Up Test-Time Compute with Latent Reasoning with Jonas Geiping
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Jonas Geiping· Mar 17, 2025
Today, we're joined by Jonas Geiping, research group leader at Ellis Institute and the Max Planck Institute for Intelligent Systems to discuss his recent paper, “Scaling up Test-Time Compute with Latent Reasoning: A Recu…
StandardSummaries only[State of RL/Reasoning] IMO/IOI Gold, OpenAI o3/GPT-5, and Cursor Composer — Ashvin Nair, Cursor
Latent Space: The AI Engineer Podcast· Dec 30, 2025
From Berkeley robotics and OpenAI's 2017 Dota-era internship to shipping RL breakthroughs on GPT-4o, o1, and o3, and now leading model development at Cursor, Ashvin Nair has done it all. We caught up with Ashvin at NeurI…
openai
StandardSummaries onlyReality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs
Latent Space: The AI Engineer Podcast· Jun 4, 2026
The new AIEWF website is live! Get your tickets booked ASAP as they -will- sell out. Take the AI Engineering Survey and get >$2k in credits and free AIE WF tickets!Most industry benchmarks compress intelligence and reaso…
evals