Podcast Episodes
Back to Search
Build Your Python Data Processing Your Way And Run It Anywhere With Fugue
Episode 266
Summary
Python has grown to be one of the top languages used for all aspects of data, from collection and cleaning, to analysis and machine learning.…
4 years ago
Bring Your Code To Your Streaming And Static Data Without Effort With The Deephaven Real Time Query Engine
Episode 264
Summary
Streaming data sources are becoming more widely available as tools to handle their storage and distribution mature. However it is still a cha…
4 years, 1 month ago
Build Your Own End To End Customer Data Platform With Rudderstack
Episode 263
Summary
Collecting, integrating, and activating data are all challenging activities. When that data pertains to your customers it can become even mor…
4 years, 1 month ago
Scale Your Spatial Analysis By Building It In SQL With Syntax Extensions
Episode 262
Summary
Along with globalization of our societies comes the need to analyze the geospatial and geotemporal data that is needed to manage the growth i…
4 years, 1 month ago
Scalable Strategies For Protecting Data Privacy In Your Shared Data Sets
Episode 261
Summary
There are many dimensions to the work of protecting the privacy of users in our data. When you need to share a data set with other teams, dep…
4 years, 1 month ago
A Reflection On Learning A Lot More Than 97 Things Every Data Engineer Should Know
Episode 260
Summary
The Data Engineering Podcast has been going for five years now and has included conversations and interviews with a huge number of guests, co…
4 years, 1 month ago
Effective Pandas Patterns For Data Engineering
Episode 259
Summary
Pandas is a powerful tool for cleaning, transforming, manipulating, or enriching data, among many other potential uses. As a result it has be…
4 years, 1 month ago
Building And Managing Data Teams And Data Platforms In Large Organizations With Ashish Mrig
Episode 257
Summary
Data engineering is a relatively young and rapidly expanding field, with practitioners having a wide array of experiences as they navigate th…
4 years, 1 month ago
The Importance Of Data Contracts As The Interface For Data Integration With Abhi Sivasailam
Episode 258
Summary
Data platforms are exemplified by a complex set of connections that are subject to a set of constantly evolving requirements. In order to mak…
4 years, 1 month ago
Automated Data Quality Management Through Machine Learning With Anomalo
Episode 256
Summary
Data quality control is a requirement for being able to trust the various reports and machine learning models that are relying on the informa…
4 years, 1 month ago