Podcast Episodes
Back to Search
Automating Your Production Dataflows On Spark
Episode 105
Summary
As data engineers the health of our pipelines is our highest priority. Unfortunately, there are countless ways that our dataflows can break o…
6 years, 7 months ago
Build Maintainable And Testable Data Applications With Dagster
Episode 104
Summary
Despite the fact that businesses have relied on useful and accurate data to succeed for decades now, the state of the art for obtaining and m…
6 years, 7 months ago
Data Orchestration For Hybrid Cloud Analytics
Episode 103
Summary
The scale and complexity of the systems that we build to satisfy business requirements is increasing as the available tools become more sophi…
6 years, 7 months ago
Keeping Your Data Warehouse In Order With DataForm
Episode 102
Summary
Managing a data warehouse can be challenging, especially when trying to maintain a common set of patterns. Dataform is a platform that helps …
6 years, 7 months ago
Fast Analytics On Semi-Structured And Structured Data In The Cloud
Episode 101
Summary
The process of exposing your data through a SQL interface has many possible pathways, each with their own complications and tradeoffs. One of…
6 years, 8 months ago
Ship Faster With An Opinionated Data Pipeline Framework
Episode 100
Summary
Building an end-to-end data pipeline for your machine learning projects is a complex task, made more difficult by the variety of ways that yo…
6 years, 8 months ago
Open Source Object Storage For All Of Your Data
Episode 99
Summary
Object storage is quickly becoming the unifying layer for data intensive applications and analytics. Modern, cloud oriented data warehouses a…
6 years, 8 months ago
Navigating Boundless Data Streams With The Swim Kernel
Episode 98
Summary
The conventional approach to analytics involves collecting large amounts of data that can be cleaned, followed by a separate step for analysi…
6 years, 8 months ago
Building A Reliable And Performant Router For Observability Data
Episode 97
Summary
The first stage in every data project is collecting information and routing it to a storage system for later analysis. For operational data t…
6 years, 8 months ago
Building A Community For Data Professionals at Data Council
Episode 96
Summary
Data professionals are working in a domain that is rapidly evolving. In order to stay current we need access to deeply technical presentation…
6 years, 9 months ago