Podcast Episodes
Back to Search
Making Spark Cloud Native At Data Mechanics
Episode 184
Summary
Spark is one of the most well-known frameworks for data processing, whether for batch or streaming, ETL or ML, and at any scale. Because of i…
5 years, 1 month ago
The Grand Vision And Present Reality of DataOps
Episode 183
Summary
The Data industry is changing rapidly, and one of the most active areas of growth is automation of data workflows. Taking cues from the DevOp…
5 years, 1 month ago
Self Service Data Exploration And Dashboarding With Superset
Episode 182
Summary
The reason for collecting, cleaning, and organizing data is to make it usable by the organization. One of the most common and widely used m…
5 years, 1 month ago
Moving Machine Learning Into The Data Pipeline at Cherre
Episode 181
Summary
Most of the time when you think about a data pipeline or ETL job what comes to mind is a purely mechanistic progression of functions that mov…
5 years, 1 month ago
Exploring The Expanding Landscape Of Data Professions with Josh Benamram of Databand
Episode 180
Summary
"Business as usual" is changing, with more companies investing in data as a first class concern. As a result, the data team is growing and in…
5 years, 2 months ago
Put Your Whole Data Team On The Same Page With Atlan
Episode 179
Summary
One of the biggest obstacles to success in delivering data products is cross-team collaboration. Part of the problem is the difference in the…
5 years, 2 months ago
Data Quality Management For The Whole Team With Soda Data
Episode 178
Summary
Data quality is on the top of everyone’s mind recently, but getting it right is as challenging as ever. One of the contributing factors is th…
5 years, 2 months ago
Real World Change Data Capture At Datacoral
Episode 177
Summary
The world of business is becoming increasingly dependent on information that is accurate up to the minute. For analytical systems, the only w…
5 years, 2 months ago
Managing The DoorDash Data Platform
Episode 176
Summary
The team at DoorDash has a complex set of optimization challenges to deal with using data that they collect from a multi-sided marketplace. I…
5 years, 2 months ago
Leave Your Data Where It Is And Automate Feature Extraction With Molecula
Episode 175
Summary
A majority of the time spent in data engineering is copying data between systems to make the information available for different purposes. Th…
5 years, 3 months ago