Results for "vision-language models"
Keyword scan across titles, descriptions, summaries, and tags. For interview listings, try Guest appearances.
3 results
Episodes
StandardSummaries onlyWaymo's Foundation Model for Autonomous Driving with Drago Anguelov
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Drago Anguelov· Mar 31, 2025
Today, we're joined by Drago Anguelov, head of AI foundations at Waymo, for a deep dive into the role of foundation models in autonomous driving. Drago shares how Waymo is leveraging large-scale machine learning, includi…
machine-learningmultimodalfoundation-modelsgenerative-ai
StandardSummaries onlyWhy Vision Language Models Ignore What They See with Munawar Hayat
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Munawar Hayat· Dec 9, 2025
In this episode, we’re joined by Munawar Hayat, researcher at Qualcomm AI Research, to discuss a series of papers presented at NeurIPS 2025 focusing on multimodal and generative AI. We dive into the persistent challenge …
multimodalgenerative-ai
StandardSummaries onlyInside Nano Banana 🍌 and the Future of Vision-Language Models with Oliver Wang
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Oliver Wang· Sep 23, 2025
Today, we’re joined by Oliver Wang, principal scientist at Google DeepMind and tech lead for Gemini 2.5 Flash Image—better known by its code name, “Nano Banana.” We dive into the development and capabilities of this newl…
google-aimultimodal