Our natively multimodal AI model Gemini is capable of reasoning across text, images, audio, video and code. Here are favorite moments with Gemini Learn more and try the model: Explore Gemini: For the purposes of this demo, latency has been reduced and Gemini outputs have been shortened for brevity. Subscribe to our Channel: Tweet with us on X: Follow us on Instagram: Join us on Facebook: 0:00 Intro 0:19 Multimodal Dialogue 1:32 Multilinguality 2:04 Game Creation 2:31 Visual Puzzles 3:17 Making Connections 3:39 Image & Text Generation 4:06 Logic & Spatial Reasoning 4:55 Translating Visuals 5:27 Cultural Understanding
Hide player controls
Hide resume playing