Exploring Multimodal AI: Why Google’s Gemini and OpenAI’s GPT-4o Chose This Path | ChatCAT and the Future of Interspecies Communication | Episode 23

The recent spring updates and demos by both Google (Gemini) and OpenAI (GPT-4o) feature prominently their multimodal capabilities. In this episode, we discuss the advantages of multimodal AI versus models focused on specific modalities such as language. Via the example of chatCAT, a hypothetical AI that helps owners understand their cats, we explore multimodal’s promise for a more holistic understanding Please enjoy this episode.For more information, check out https://www.su...

Om Podcasten

Are you a critical thinker ready to dive into AI? Welcome to Super Prompt: The Generative AI Podcast. Join me, Tony Wan, an ex Silicon Valley executive, as we 'unhype the hype'  of AI via illuminating conversations with top engineers, and in-depth solo episodes. Our goal? To make it almost unnecessary to send a cybernetic organism back in time to fix things. Tailored for the technically-minded and discerningly skeptical, our discussions cover Large Language Models (LLMs), neural networks, multi-modal systems, and autonomous vehicles. We also cover  the latest breakthroughs from leaders like OpenAI's chatGPT, Google's Gemini, Anthropic's Claude, Meta's Llama, along with the startup ecosystem. Together, let’s stay ahead of AI, BEFORE it asks for your clothes… your boots…. and your motorcycle!