Podcast Episodes
Back to Search
Scale Your Spatial Analysis By Building It In SQL With Syntax Extensions
Episode 262
Summary
Along with globalization of our societies comes the need to analyze the geospatial and geotemporal data that is needed to manage the growth i…
4 years, 4 months ago
Scalable Strategies For Protecting Data Privacy In Your Shared Data Sets
Episode 261
Summary
There are many dimensions to the work of protecting the privacy of users in our data. When you need to share a data set with other teams, dep…
4 years, 4 months ago
A Reflection On Learning A Lot More Than 97 Things Every Data Engineer Should Know
Episode 260
Summary
The Data Engineering Podcast has been going for five years now and has included conversations and interviews with a huge number of guests, co…
4 years, 4 months ago
Effective Pandas Patterns For Data Engineering
Episode 259
Summary
Pandas is a powerful tool for cleaning, transforming, manipulating, or enriching data, among many other potential uses. As a result it has be…
4 years, 4 months ago
Building And Managing Data Teams And Data Platforms In Large Organizations With Ashish Mrig
Episode 257
Summary
Data engineering is a relatively young and rapidly expanding field, with practitioners having a wide array of experiences as they navigate th…
4 years, 4 months ago
The Importance Of Data Contracts As The Interface For Data Integration With Abhi Sivasailam
Episode 258
Summary
Data platforms are exemplified by a complex set of connections that are subject to a set of constantly evolving requirements. In order to mak…
4 years, 4 months ago
Automated Data Quality Management Through Machine Learning With Anomalo
Episode 256
Summary
Data quality control is a requirement for being able to trust the various reports and machine learning models that are relying on the informa…
4 years, 5 months ago
An Introduction To Data And Analytics Engineering For Non-Programmers
Episode 255
Summary
Applications of data have grown well beyond the venerable business intelligence dashboards that organizations have relied on for decades. Now…
4 years, 5 months ago
Open Source Reverse ETL For Everyone With Grouparoo
Episode 254
Summary
Reverse ETL is a product category that evolved from the landscape of customer data platforms with a number of companies offering their own im…
4 years, 5 months ago
Data Observability Out Of The Box With Metaplane
Episode 253
Summary
Data observability is a set of technical and organizational capabilities related to understanding how your data is being processed and used s…
4 years, 5 months ago