171: Machine Learning Pipelines Are Still Data Pipelines with Sandy Ryza of Dagster

This week on The Data Stack Show, Eric and Kostas chat with Sandy Ryza, Lead Engineer at Dagster. During the episode, Sandy shares insights on data cleaning, data engineering processes, and the need for improved tools. He introduces Dagster, an orchestrator that focuses on assets like tables, datasets, and machine learning models, and contrasts it with traditional workflow systems. He also explains Dagster’s integration with DBT, while also exploring the changing dynamics in data roles, the impact of modern tooling, the potential for increased creativity in the field, and more.

Om Podcasten