Podcast Episodes
Back to Search
Interactive Exploratory Data Analysis On Petabyte Scale Data Sets With Arkouda
Episode 312
Summary
Exploratory data analysis works best when the feedback loop is fast and iterative. This is easy to achieve when you are working on small data…
3 years, 10 months ago
What "Data Lineage Done Right" Looks Like And How They're Doing It At Manta
Episode 311
Summary
Data lineage is the roadmap for your data platform, providing visibility into all of the dependencies for any report, machine learning model,…
3 years, 10 months ago
Re-Bundling The Data Stack With Data Orchestration And Software Defined Assets Using Dagster
Episode 310
Summary
The current stage of evolution in the data management ecosystem has resulted in domain and use case specific orchestration capabilities being…
3 years, 10 months ago
Writing The Book That Offers A Single Reference For The Fundamentals Of Data Engineering
Episode 309
Summary
Data engineering is a difficult job, requiring a large number of skills that often don’t overlap. Any effort to understand how to start a car…
3 years, 10 months ago
Joe Reis Flips The Script And Interviews Tobias Macey About The Data Engineering Podcast
Episode 308
Summary
Data engineering is a large and growing subject, with new technologies, specializations, and "best practices" emerging at an accelerating pac…
3 years, 10 months ago
Making The Total Cost Of Ownership For External Data Manageable With Crux
Episode 307
Summary
There are extensive and valuable data sets that are available outside the bounds of your organization. Whether that data is public, paid, or …
3 years, 10 months ago
Charting the Path of Riskified's Data Platform Journey
Episode 306
Summary
Building a data platform is a journey, not a destination. Beyond the work of assembling a set of technologies and building integrations acros…
3 years, 11 months ago
Maintain Your Data Engineers' Sanity By Embracing Automation
Episode 305
Summary
Building and maintaining reliable data assets is the prime directive for data engineers. While it is easy to say, it is endlessly complex to …
3 years, 11 months ago
Be Confident In Your Data Integration By Quickly Validating Matching Records With data-diff
Episode 304
Summary
The perennial challenge of data engineers is ensuring that information is integrated reliably. While it is straightforward to know whether a …
3 years, 11 months ago
The View From The Lakehouse Of Architectural Patterns For Your Data Platform
Episode 303
Summary
The ecosystem for data tools has been going through rapid and constant evolution over the past several years. These technological shifts have…
3 years, 11 months ago