Results for "vision-language models"

Keyword scan across titles, descriptions, summaries, and tags. For interview listings, try Guest appearances.

3 results

Episodes

  • The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
    StandardSummaries only

    Waymo's Foundation Model for Autonomous Driving with Drago Anguelov

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Drago Anguelov· Mar 31, 2025

    Today, we're joined by Drago Anguelov, head of AI foundations at Waymo, for a deep dive into the role of foundation models in autonomous driving. Drago shares how Waymo is leveraging large-scale machine learning, includi

    machine-learningmultimodalfoundation-modelsgenerative-ai
  • The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
    StandardSummaries only

    Why Vision Language Models Ignore What They See with Munawar Hayat

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Munawar Hayat· Dec 9, 2025

    In this episode, we’re joined by Munawar Hayat, researcher at Qualcomm AI Research, to discuss a series of papers presented at NeurIPS 2025 focusing on multimodal and generative AI. We dive into the persistent challenge

    multimodalgenerative-ai
  • The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
    StandardSummaries only

    Inside Nano Banana 🍌 and the Future of Vision-Language Models with Oliver Wang

    The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Oliver Wang· Sep 23, 2025

    Today, we’re joined by Oliver Wang, principal scientist at Google DeepMind and tech lead for Gemini 2.5 Flash Image—better known by its code name, “Nano Banana.” We dive into the development and capabilities of this newl

    google-aimultimodal