Results for "vision-language models"
2 results
Episodes
StandardSummaries onlyWhy Vision Language Models Ignore What They See with Munawar Hayat
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Munawar Hayat· Dec 9, 2025
In this episode, we’re joined by Munawar Hayat, researcher at Qualcomm AI Research, to discuss a series of papers presented at NeurIPS 2025 focusing on multimodal and generative AI. We dive into the persistent challenge …
multimodalgenerative-ai
StandardSummaries onlyInside Nano Banana 🍌 and the Future of Vision-Language Models with Oliver Wang
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)· Oliver Wang· Sep 23, 2025
Today, we’re joined by Oliver Wang, principal scientist at Google DeepMind and tech lead for Gemini 2.5 Flash Image—better known by its code name, “Nano Banana.” We dive into the development and capabilities of this newl…
google-aimultimodal