Podcast Episodes
Back to Search
Exploring The Design And Benefits Of The Modern Data Stack
Episode 203
Summary
We have been building platforms and workflows to store, process, and analyze data since the earliest days of computing. Over that time there …
4 years, 11 months ago
Democratize Data Cleaning Across Your Organization With Trifacta
Episode 202
Summary
Every data project, whether it’s analytics, machine learning, or AI, starts with the work of data cleaning. This is a critical step and benef…
4 years, 11 months ago
Stick All Of Your Systems And Data Together With SaaSGlue As Your Workflow Manager
Episode 201
Summary
At the core of every data pipeline is an workflow manager (or several). Deploying, managing, and scaling that orchestration can consume a lar…
4 years, 11 months ago
Leveling Up Open Source Data Integration With Meltano Hub And The Singer SDK
Episode 200
Summary
Data integration in the form of extract and load is the critical first step of every data project. There are a large number of commercial and…
4 years, 11 months ago
A Candid Exploration Of Timeseries Data Analysis With InfluxDB
Episode 199
Summary
While the overall concept of timeseries data is uniform, its usage and applications are far from it. One of the most demanding applications o…
4 years, 11 months ago
Lessons Learned From The Pipeline Data Engineering Academy
Episode 198
Summary
Data Engineering is a broad and constantly evolving topic, which makes it difficult to teach in a concise and effective manner. Despite that,…
4 years, 11 months ago
Make Database Performance Optimization A Playful Experience With OtterTune
Episode 197
Summary
The database is the core of any system because it holds the data that drives your entire experience. We spend countless hours designing the d…
4 years, 11 months ago
Bring Order To The Chaos Of Your Unstructured Data Assets With Unstruk
Episode 196
Summary
Working with unstructured data has typically been a motivation for a data lake. The challenge is imposing enough order on the platform to mak…
5 years ago
Accelerating ML Training And Delivery With In-Database Machine Learning
Episode 195
Summary
When you build a machine learning model, the first step is always to load your data. Typically this means downloading files from object stora…
5 years ago
Taking A Tour Of The Google Cloud Platform For Data And Analytics
Episode 194
Summary
Google pioneered an impressive number of the architectural underpinnings of the broader big data ecosystem. Now they offer the technologies t…
5 years ago