What is Data Science? - Vicki Boykis

Data science, data engineering, data analysis, and machine learning are part of the recent massive growth of Python.

But really what is data science?

Vicki Boykis helps me understand questions like:

  • No really, what is data science?
  • What does a data pipeline look like?
  • What is it like to do data science, data analysis, data engineering?
  • Can you do analysis on a laptop?
  • How big does data have to be to be considered big?
  • What are the challenges in data science?
  • Does it make sense for software engineers to learn data engineering, data science, pipelines, etc?
  • How could someone start learning data science?

Also covered:

  • A type work (analysis) vs B type work (building)
  • data lakes and data swamps
  • predictive models
  • data cleaning
  • development vs experimentation
  • Jupyter Notebooks
  • Kaggle
  • ETL pipelines

I learned a lot about the broad field of data science from talking with Vicki.

Special Guest: Vicki Boykis.

Sponsored By:

Support Test & Code: Python Software Testing & Engineering

Links:

Om Podcasten

Topics include automated testing, testing strategy, software engineering practices, packaging, Python, pytest, data science, TDD, continuous integration, and software methodologies. Also anything I think helps make the daily life of a software developer more fun and rewarding. Hosted by Brian Okken.