Results for "AI inference"
Keyword scan across titles, descriptions, summaries, and tags. For interview listings, try Guest appearances.
15 results
Episodes
StandardSummaries onlyDataflow Computing for AI Inference with Kunle Olukotun
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Kunle Olukotun· Oct 14, 2025
In this episode, we're joined by Kunle Olukotun, professor of electrical engineering and computer science at Stanford University and co-founder and chief technologist at Sambanova Systems, to discuss reconfigurable dataf…
inference
Intelligent report11AM Hour: Vista Equity Partners CEO Robert Smith, Honeywell Aerospace CEO Ahead of Spinoff & Medtronic CEO on Earnings 6/3/26
Squawk on the Street· Jun 4, 2026
M&A veteran Paul Taubman tempers deal-making optimism, Vista's Robert Smith unveils low-cost AI inference infrastructure, and Space X targets a record-shattering $75B IPO as markets navigate Iran tensions and alt-manager…
inferenceinvesting
StandardSummaries onlyScaling Agentic Inference Across Heterogeneous Compute with Zain Asgar
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Zain Asgar· Dec 2, 2025
In this episode, Zain Asgar, co-founder and CEO of Gimlet Labs, joins us to discuss the heterogeneous AI inference across diverse hardware. Zain argues that the current industry standard of running all AI workloads on hi…
llmagentsinference
StandardSummaries onlyMultimodal AI Models on Apple Silicon with MLX with Prince Canuma
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Prince Canuma· Aug 26, 2025
Today, we're joined by Prince Canuma, an ML engineer and open-source developer focused on optimizing AI inference on Apple Silicon devices. Prince shares his journey to becoming one of the most prolific contributors to A…
multimodalinference
StandardSummaries onlyHow to Engineer AI Inference Systems with Philip Kiely
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Philip Kiely· Apr 30, 2026
In this episode, Philip Kiely, head of AI education at Baseten, joins us to unpack the fast-evolving discipline of inference engineering. We explore why inference has become the stickiest and most critical workload in AI…
inference
StandardSummaries onlyClosing the Loop Between AI Training and Inference with Lin Qiao
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Lin Qiao· Aug 12, 2025
In this episode, we're joined by Lin Qiao, CEO and co-founder of Fireworks AI. Drawing on key lessons from her time building PyTorch, Lin shares her perspective on the modern generative AI development lifecycle. She expl…
pytorchinferencegenerative-ai
StandardSummaries onlyAccelerating AI Training and Inference with AWS Trainium2 with Ron Diamant
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Ron Diamant· Feb 24, 2025
Today, we're joined by Ron Diamant, chief architect for Trainium at Amazon Web Services, to discuss hardware acceleration for generative AI and the design and role of the recently released Trainium2 chip. We explore the …
inferencegenerative-ai
Intelligent reportInterviewing For A Job At Anthropic? DON’T Use AI.
Tech Brew Ride Home· Jun 1, 2026
Nvidia unveils ARM-based RTX Spark chips and a trillion-parameter desktop supercomputer, Minimax launches a coding model at 1/40th Anthropic's price, and Anthropic bans AI in its own hiring process.
anthropic
StandardSummaries onlyJensen Huang LIVE: Nvidia's Future, Physical AI, Rise of the Agent, Inference Explosion, AI PR Crisis
All-In with Chamath, Jason, Sacks & Friedberg· Mar 19, 2026
(0:00) Jensen Huang joins the show! (0:26) Acquiring Groq and the inference explosion (8:53) Decision making at the world's most valuable company (10:47) Physical AI's $50T market, OpenClaw's future, the new operating sy…
anthropicagentsinference
StandardSummaries onlyHigh-Efficiency Diffusion Models for On-Device Image Generation and Editing with Hung Bui
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Hung Bui· Oct 28, 2025
In this episode, Hung Bui, Technology Vice President at Qualcomm, joins us to explore the latest high-efficiency techniques for running generative AI, particularly diffusion models, on-device. We dive deep into the techn…
inferencegenerative-ai
StandardSummaries onlyNVIDIA's AI Engineers: Agent Inference at Planetary Scale and "Speed of Light" — Nader Khalil (Brev), Kyle Kranen (Dynamo)
Latent Space: The AI Engineer Podcast· Mar 10, 2026
Join Kyle, Nader, Vibhu, and swyx live at NVIDIA GTC next week!Now that AIE Europe tix are ~sold out, our attention turns to Miami and World’s Fair!The definitive AI Accelerator chip company has more than 10xed this AI S…
inference
StandardSummaries only20VC: Brex Acquired for $5.15BN | a16z Companies are 2/3 AI Revenues | Anthropic Inference Costs Skyrocket | OpenEvidence Raises at $12BN Valuation | The IPO Market: EquipmentShare, Wealthfront and Ethos Insurance
The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch· Jan 29, 2026
AGENDA: 03:36 Brex Acquisition by Capital One for $5.15BN 10:54 Does Brex's Acquisition Help or Hurt Ramp? 16:28 TikTok Deal Completed: Who Won & Who Lost: Analysis 19:30 Anthropic Inference Costs Higher Than Expected 37…
anthropicinferenceinvesting
StandardSummaries onlyThe AI Sec-Pocalypse Is Actually Nigh?
Tech Brew Ride Home· May 11, 2026
Google confirms AI-assisted zero-day exploit, OpenAI launches $4B+ deployment company, Apple refines Liquid Glass, TikTok goes ad-free in UK, and why agentic AI will reshape compute infrastructure.
openaiagentsinference
Intelligent reportPodcast EP339: Unique Scalable, Power-Efficient AI Technology from EdgeCortix with Dr. Sakya Dasgupta
SemiWiki.com· Apr 10, 2026
Edge Cortex CEO Sakya Dasgupta explains how their Sakura 2 chip delivers 60 TOPS under 8 watts, passed NASA radiation testing, and is heading toward space deployment by 2027.
artificial-intelligence
StandardSummaries onlyAI Trends 2026: OpenClaw Agents, Reasoning LLMs, and More with Sebastian Raschka
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Sebastian Raschka· Feb 26, 2026
In this episode, Sebastian Raschka, independent LLM researcher and author, joins us to break down how the LLM landscape has changed over the past year and what is likely to matter most in 2026. We discuss the shift from …
llminference