Gen AI pilots fail, GPT-5's hidden prompt revealed, reasoning model flaws and Claude closing chats

Is enterprise AI in danger? In episode 69 of Mixture of Experts, host Tim Hwang is joined by Marina Danilevsky, Nathalie Baracaldo and Sandi Besen to debrief MIT’s report on gen AI pilots. Next, GPT-5 has a hidden system prompt? Then, we revisit the conversation about chain of thought (CoT) reasoning with our researchers. Are large reasoning models not thinking straight? Finally, Anthropic announced Claude will close down "distressing” conversations and we debate AI welfare. All that and more on today’s episode of Mixture of Experts. 00:00 – Intro 1:13 – US Open, Meta restructuring Superintelligence lab and Robot Olympics 3:11 – Gen AI pilots fail 11:09 – GPT-5's hidden prompt revealed 22:47 – Reasoning model flaws 33:55 – Claude closing chats  The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity. Subscribe to the Think newsletter → https://www.ibm.com/account/reg/us-en/signup?formid=news-urx-52120 Learn more about artificial intelligence → https://www.ibm.com/think/artificial-intelligence Visit Mixture of Experts podcast page to get more AI content → https://www.ibm.com/think/podcasts/mixture-of-experts 

Om Podcasten

Welcome to Mixture of Experts, your weekly deep dive into the ever-evolving landscape of artificial intelligence—bringing you insightful discussions on the latest AI trends, innovations, and their impact on business. From breakthrough research to practical applications, each episode offers a balanced blend of expertise and analysis. Explore how AI is reshaping industries, driving efficiency, and unlocking new opportunities for growth. Whether you're a seasoned professional seeking to stay ahead of the curve or an enthusiast curious about the future of technology, Mixture of Experts delivers the perfect mix of insights and practical knowledge. Tune in and stay informed as we navigate the dynamic intersection of AI and business.