E111 - How to build a LLM - Ariel Ekgren

The 111th episode of the AI After Work Podcast features Ariel Ekgren, a distinguished Research Scientist focused on developing Large Language Models (LLMs) for Sweden and the Nordics. Ekgren, who is both a Research Scientist and Tech Lead at AI Sweden, shares insights on the breakthroughs in deep learning and Natural Language Understanding. The episode delves into various topics, such as the impact of GPT decoder-only architecture, reasoning in GPT models, the Q* algorithm's progress towards ...

Om Podcasten