Cover art for The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Niklas Muennighoff

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Inside s1: An o1-Style Reasoning Model That Cost Under $50 to Train with Niklas Muennighoff

Published
March 3, 2025
Duration
49:29
Summary source
description
Last updated
Jun 7, 2026

Discusses openai, google-ai.

Summary

Today, we're joined by Niklas Muennighoff, a PhD student at Stanford University, to discuss his paper, “S1: Simple Test-Time Scaling.” We explore the motivations behind S1, as well as how it compares to OpenAI's O1 and DeepSeek's R1 models. We dig into the different approaches to test-time scaling, including parallel and sequential scaling, as well as S1’…

Intelligent report

Sign in to read teasers, or upgrade to Research Pro to commission a new dossier for this episode. Learn more →

Show notes

Today, we're joined by Niklas Muennighoff, a PhD student at Stanford University, to discuss his paper, “S1: Simple Test-Time Scaling.” We explore the motivations behind S1, as well as how it compares to OpenAI's O1 and DeepSeek's R1 models. We dig into the different approaches to test-time scaling, including parallel and sequential scaling, as well as S1’s data curation process, its training recipe, and its use of model distillation from Google Gemini and DeepSeek R1. We explore the novel "budge

Themes

  • openai
  • google-ai
Niklas Muennighoff: Inside s1: An o1-Style Reasoning Model That Cost Under $50 to Train with Niklas Muennighoff | The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) | Vagelintel