EP47: GPT-5 Rumors, AutoGen Studio, SeeAct Web Agents, Google AMIE, Anthropic’s Sleeper Agents

Build AI Agents & Try AI Agents From The Show On SimTheory: https://simtheory.aiJoin Discord: https://discord.gg/aphwE5snuqGet Merch: https://www.thisdayinaimerch.com/DESCRIPTION====In this episode, we dive into the buzz around GPT-5, sparked by Sam Altman's revelations on Bill Gates' latest podcast. We share our top hopes and dreams for GPT-5 and future AI advancements. Next, we delve into Microsoft's new CoPilot Pro Subscription, exploring how it stands out from ChatGPT Plus. Chris takes AutoGen Studio for a spin and ponders over its ideal user base. The episode then shifts to the intriguing concept of collaborative AI agents - is this the path to AI's mastering reasoning, reflection, and profound thought? We dissect the insights from the SeeAct Web Agents study, assessing its influence on AI agent development. Shifting gears, we discuss Google AMIE's groundbreaking ability to outperform doctors in diagnoses, even those assisted by AI. To wrap up, we spotlight the significance of Anthropic's Sleeper Agents experiment and its groundbreaking findings.Thanks for listening. Please consider subscribing if you haven't already and leaving a review. We appreciate all of your support!CHAPTERS:====00:00 - Cold Open00:31 - GTP-5 Rumors & Leaks07:32 - Microsoft CoPilot Pro22:27 - Microsoft's AutoGen Studio: An open-source UI for AutoGen38:53 - The Future of AI Agents? LAMs and SeeACT Web Agent Paper1:00:19 - Google AMIE: Can AI Replace Doctors for Diagnosis?1:13:12 -Anthropic's Sleep Agents ExperimentSOURCES:====https://twitter.com/arrakis_ai/status/1745672203683942863?s=20https://twitter.com/daniacostaai/status/1746554047878824409?s=46https://blogs.microsoft.com/blog/2024/01/15/bringing-the-full-power-of-copilot-to-more-people-and-businesses/https://twitter.com/emollick/status/1747359731595763817https://microsoft.github.io/autogen/blog/2023/12/01/AutoGenStudio/https://osu-nlp-group.github.io/SeeAct/https://blog.research.google/2024/01/amie-research-ai-system-for-diagnostic_12.htmlhttps://www.bloomberg.com/news/articles/2024-01-14/artificial-intelligence-will-affect-almost-40-of-jobs-imf-sayshttps://twitter.com/Teknium1/status/1746067427379798344PAPERS:====https://arxiv.org/pdf/2401.01614.pdfhttps://arxiv.org/pdf/2401.05654.pdfhttps://arxiv.org/pdf/2401.05566.pdf

Om Podcasten

Join Michael and Chris Sharkey, two proudly average tech enthusiasts, as they stumble through the world of artificial intelligence with all the grace of a robot learning to dance. This (sometimes weekly*) podcast delivers an hour-long conversation about their thoroughly middle-of-the-road adventures with AI. No PhDs. No Silicon Valley insights. Just two guys with enough technical knowledge to be dangerous, sharing their unexceptional yet entertaining experiences with AI tools and technology. Subscribe now to hear: • Mediocre hot takes on AI developments • Stories of AI experiments gone adequately okay • The most average advice you'll ever need • Two Sharkeys trying their best to sound smart about algorithms • Childish AI prank calls that somehow fool everybody • Attempts at using AI for phishing attacks on their mother • "Chart-topping" AI songs according to the brothers Join our perfectly mediocre community where being average at AI is celebrated, questions are encouraged, and learning through mistakes is our specialty. Because let's face it - most of us are figuring this out as we go along. New episodes drop whenever we remember to record them. 🎙️ Proudly supported by Simtheory.ai