E111 - How to build a LLM - Ariel Ekgren

The 111th episode of the AI After Work Podcast features Ariel Ekgren, a distinguished Research Scientist focused on developing Large Language Models (LLMs) for Sweden and the Nordics. Ekgren, who is both a Research Scientist and Tech Lead at AI Sweden, shares insights on the breakthroughs in deep learning and Natural Language Understanding. The episode delves into various topics, such as the impact of GPT decoder-only architecture, reasoning in GPT models, the Q* algorithm's progress towards AGI, and the creation and challenges of GPT-SW3, a specialized LLM for the Nordic region. Additionally, the discussion covers potential use cases for GPT-SW3, the benefits of multilingual versus region-specific models, future steps for GPT-SW3, the future of AI in Sweden, and speculations on whether AGI might lead to a dystopian or utopian future.Follow us on youtube: https://www.youtube.com/@aiawpodcast

Om Podcasten

The Artificial Intelligence After Work (AIAW) podcast is a weekly live streamed long format conversation aiming to demystify data innovation and AI, as well as their impact to future business and society by bringing the listeners close to the challenges that AI practitioners aim to solve today. The case-study, industry-by-industry, human-focused, and guest personal angle on the topic approach makes the podcast educational, emotional, engaging, and entertaining to all who are interested in learning more about AI, the future developments in the area, or simply getting exposed to variety of topics from practitioners and experts with first-hand industry experience and knowledge in the topic of the day. Hosts: Anders Arpteg & Henrik Göthberg. Program Manager: Goran Cvetanovski