Podcast Episodes
Back to Search
Building A Real Time Event Data Warehouse For Sentry
Episode 108
Summary
The team at Sentry has built a platform for anyone in the world to send software errors and events. As they scaled the volume of customers an…
6 years, 3 months ago
Escaping Analysis Paralysis For Your Data Platform With Data Virtualization
Episode 107
Summary
With the constant evolution of technology for data management it can seem impossible to make an informed decision about whether to build a da…
6 years, 3 months ago
Designing For Data Protection
Episode 106
Summary
The practice of data management is one that requires technical acumen, but there are also many policy and regulatory issues that inform and i…
6 years, 4 months ago
Automating Your Production Dataflows On Spark
Episode 105
Summary
As data engineers the health of our pipelines is our highest priority. Unfortunately, there are countless ways that our dataflows can break o…
6 years, 4 months ago
Build Maintainable And Testable Data Applications With Dagster
Episode 104
Summary
Despite the fact that businesses have relied on useful and accurate data to succeed for decades now, the state of the art for obtaining and m…
6 years, 4 months ago
Data Orchestration For Hybrid Cloud Analytics
Episode 103
Summary
The scale and complexity of the systems that we build to satisfy business requirements is increasing as the available tools become more sophi…
6 years, 4 months ago
Keeping Your Data Warehouse In Order With DataForm
Episode 102
Summary
Managing a data warehouse can be challenging, especially when trying to maintain a common set of patterns. Dataform is a platform that helps …
6 years, 5 months ago
Fast Analytics On Semi-Structured And Structured Data In The Cloud
Episode 101
Summary
The process of exposing your data through a SQL interface has many possible pathways, each with their own complications and tradeoffs. One of…
6 years, 5 months ago
Ship Faster With An Opinionated Data Pipeline Framework
Episode 100
Summary
Building an end-to-end data pipeline for your machine learning projects is a complex task, made more difficult by the variety of ways that yo…
6 years, 5 months ago
Open Source Object Storage For All Of Your Data
Episode 99
Summary
Object storage is quickly becoming the unifying layer for data intensive applications and analytics. Modern, cloud oriented data warehouses a…
6 years, 5 months ago