Results for "AI inference"

Keyword scan across titles, descriptions, summaries, and tags. For interview listings, try Guest appearances.

15 results

Episodes

  • The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
    StandardSummaries only

    Dataflow Computing for AI Inference with Kunle Olukotun

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Kunle Olukotun· Oct 14, 2025

    In this episode, we're joined by Kunle Olukotun, professor of electrical engineering and computer science at Stanford University and co-founder and chief technologist at Sambanova Systems, to discuss reconfigurable dataf

    inference
  • Squawk on the Street
    Intelligent report

    11AM Hour: Vista Equity Partners CEO Robert Smith, Honeywell Aerospace CEO Ahead of Spinoff & Medtronic CEO on Earnings 6/3/26

    Squawk on the Street· Jun 4, 2026

    M&A veteran Paul Taubman tempers deal-making optimism, Vista's Robert Smith unveils low-cost AI inference infrastructure, and Space X targets a record-shattering $75B IPO as markets navigate Iran tensions and alt-manager

    inferenceinvesting
  • The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
    StandardSummaries only

    Scaling Agentic Inference Across Heterogeneous Compute with Zain Asgar

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Zain Asgar· Dec 2, 2025

    In this episode, Zain Asgar, co-founder and CEO of Gimlet Labs, joins us to discuss the heterogeneous AI inference across diverse hardware. Zain argues that the current industry standard of running all AI workloads on hi

    llmagentsinference
  • The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
    StandardSummaries only

    Multimodal AI Models on Apple Silicon with MLX with Prince Canuma

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Prince Canuma· Aug 26, 2025

    Today, we're joined by Prince Canuma, an ML engineer and open-source developer focused on optimizing AI inference on Apple Silicon devices. Prince shares his journey to becoming one of the most prolific contributors to A

    multimodalinference
  • The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
    StandardSummaries only

    How to Engineer AI Inference Systems with Philip Kiely

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Philip Kiely· Apr 30, 2026

    In this episode, Philip Kiely, head of AI education at Baseten, joins us to unpack the fast-evolving discipline of inference engineering. We explore why inference has become the stickiest and most critical workload in AI

    inference
  • The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
    StandardSummaries only

    Closing the Loop Between AI Training and Inference with Lin Qiao

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Lin Qiao· Aug 12, 2025

    In this episode, we're joined by Lin Qiao, CEO and co-founder of Fireworks AI. Drawing on key lessons from her time building PyTorch, Lin shares her perspective on the modern generative AI development lifecycle. She expl

    pytorchinferencegenerative-ai
  • The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
    StandardSummaries only

    Accelerating AI Training and Inference with AWS Trainium2 with Ron Diamant

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Ron Diamant· Feb 24, 2025

    Today, we're joined by Ron Diamant, chief architect for Trainium at Amazon Web Services, to discuss hardware acceleration for generative AI and the design and role of the recently released Trainium2 chip. We explore the

    inferencegenerative-ai
  • Tech Brew Ride Home
    Intelligent report

    Interviewing For A Job At Anthropic? DON’T Use AI.

    Tech Brew Ride Home· Jun 1, 2026

    Nvidia unveils ARM-based RTX Spark chips and a trillion-parameter desktop supercomputer, Minimax launches a coding model at 1/40th Anthropic's price, and Anthropic bans AI in its own hiring process.

    anthropic
  • All-In with Chamath, Jason, Sacks & Friedberg
    StandardSummaries only

    Jensen Huang LIVE: Nvidia's Future, Physical AI, Rise of the Agent, Inference Explosion, AI PR Crisis

    All-In with Chamath, Jason, Sacks & Friedberg· Mar 19, 2026

    (0:00) Jensen Huang joins the show! (0:26) Acquiring Groq and the inference explosion (8:53) Decision making at the world's most valuable company (10:47) Physical AI's $50T market, OpenClaw's future, the new operating sy

    anthropicagentsinference
  • The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
    StandardSummaries only

    High-Efficiency Diffusion Models for On-Device Image Generation and Editing with Hung Bui

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Hung Bui· Oct 28, 2025

    In this episode, Hung Bui, Technology Vice President at Qualcomm, joins us to explore the latest high-efficiency techniques for running generative AI, particularly diffusion models, on-device. We dive deep into the techn

    inferencegenerative-ai
  • Latent Space: The AI Engineer Podcast
    StandardSummaries only

    NVIDIA's AI Engineers: Agent Inference at Planetary Scale and "Speed of Light" — Nader Khalil (Brev), Kyle Kranen (Dynamo)

    Latent Space: The AI Engineer Podcast· Mar 10, 2026

    Join Kyle, Nader, Vibhu, and swyx live at NVIDIA GTC next week!Now that AIE Europe tix are ~sold out, our attention turns to Miami and World’s Fair!The definitive AI Accelerator chip company has more than 10xed this AI S

    inference
  • The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch
    StandardSummaries only

    20VC: Brex Acquired for $5.15BN | a16z Companies are 2/3 AI Revenues | Anthropic Inference Costs Skyrocket | OpenEvidence Raises at $12BN Valuation | The IPO Market: EquipmentShare, Wealthfront and Ethos Insurance

    The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch· Jan 29, 2026

    AGENDA: 03:36 Brex Acquisition by Capital One for $5.15BN 10:54 Does Brex's Acquisition Help or Hurt Ramp? 16:28 TikTok Deal Completed: Who Won & Who Lost: Analysis 19:30 Anthropic Inference Costs Higher Than Expected 37

    anthropicinferenceinvesting
  • Tech Brew Ride Home
    StandardSummaries only

    The AI Sec-Pocalypse Is Actually Nigh?

    Tech Brew Ride Home· May 11, 2026

    Google confirms AI-assisted zero-day exploit, OpenAI launches $4B+ deployment company, Apple refines Liquid Glass, TikTok goes ad-free in UK, and why agentic AI will reshape compute infrastructure.

    openaiagentsinference
  • SemiWiki.com
    Intelligent report

    Podcast EP339: Unique Scalable, Power-Efficient AI Technology from EdgeCortix with Dr. Sakya Dasgupta

    SemiWiki.com· Apr 10, 2026

    Edge Cortex CEO Sakya Dasgupta explains how their Sakura 2 chip delivers 60 TOPS under 8 watts, passed NASA radiation testing, and is heading toward space deployment by 2027.

    artificial-intelligence
  • The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
    StandardSummaries only

    AI Trends 2026: OpenClaw Agents, Reasoning LLMs, and More with Sebastian Raschka

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Sebastian Raschka· Feb 26, 2026

    In this episode, Sebastian Raschka, independent LLM researcher and author, joins us to break down how the LLM landscape has changed over the past year and what is likely to matter most in 2026. We discuss the shift from

    llminference