Results for "reasoning model"

8 results

Episodes

  • The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
    StandardSummaries only

    Teaching LLMs to Self-Reflect with Reinforcement Learning with Maohao Shen

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Maohao Shen· Apr 8, 2025

    Today, we're joined by Maohao Shen, PhD student at MIT to discuss his paper, “Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search.” We dig into how Satori leverage

    llm
  • Lex Fridman Podcast
    Intelligent report

    #490 – State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI

    Lex Fridman Podcast· Feb 1, 2026

    Sebastian Raschka and Nathan Lambert break down the AI landscape with Lex Friedman, covering open-weight models, Chinese lab competition, coding tools, and why transformer architectures haven't fundamentally changed desp

    llmmachine-learningsafety-alignmentsociety
  • Tech Brew Ride Home
    Intelligent report

    Microsoft Build

    Tech Brew Ride Home· Jun 3, 2026

    Microsoft Build 2026 dominated by agent-first computing: Scout for Teams, the RTX Spark Dev Box, seven new AI models, Project Solara hardware platform, and a redesigned quantum chip targeting 2029.

    anthropicagents
  • The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
    StandardSummaries only

    Inside s1: An o1-Style Reasoning Model That Cost Under $50 to Train with Niklas Muennighoff

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Niklas Muennighoff· Mar 3, 2025

    Today, we're joined by Niklas Muennighoff, a PhD student at Stanford University, to discuss his paper, “S1: Simple Test-Time Scaling.” We explore the motivations behind S1, as well as how it compares to OpenAI's O1 and D

    openaigoogle-ai
  • The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
    StandardSummaries only

    AI Trends 2026: OpenClaw Agents, Reasoning LLMs, and More with Sebastian Raschka

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Sebastian Raschka· Feb 26, 2026

    In this episode, Sebastian Raschka, independent LLM researcher and author, joins us to break down how the LLM landscape has changed over the past year and what is likely to matter most in 2026. We discuss the shift from

    llminference
  • The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
    StandardSummaries only

    Scaling Up Test-Time Compute with Latent Reasoning with Jonas Geiping

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Jonas Geiping· Mar 17, 2025

    Today, we're joined by Jonas Geiping, research group leader at Ellis Institute and the Max Planck Institute for Intelligent Systems to discuss his recent paper, “Scaling up Test-Time Compute with Latent Reasoning: A Recu

  • Latent Space: The AI Engineer Podcast
    StandardSummaries only

    [State of RL/Reasoning] IMO/IOI Gold, OpenAI o3/GPT-5, and Cursor Composer — Ashvin Nair, Cursor

    Latent Space: The AI Engineer Podcast· Dec 30, 2025

    From Berkeley robotics and OpenAI's 2017 Dota-era internship to shipping RL breakthroughs on GPT-4o, o1, and o3, and now leading model development at Cursor, Ashvin Nair has done it all. We caught up with Ashvin at NeurI

    openai
  • Latent Space: The AI Engineer Podcast
    StandardSummaries only

    Reality: The Final Eval — Lukas Petersson and Axel Backlund of Andon Labs

    Latent Space: The AI Engineer Podcast· Jun 4, 2026

    The new AIEWF website is live! Get your tickets booked ASAP as they -will- sell out. Take the AI Engineering Survey and get >$2k in credits and free AIE WF tickets!Most industry benchmarks compress intelligence and reaso

    evals