Podcast Episodes
Back to Search
Navigating Boundless Data Streams With The Swim Kernel
Episode 98
Summary
The conventional approach to analytics involves collecting large amounts of data that can be cleaned, followed by a separate step for analysi…
6 years, 5 months ago
Building A Reliable And Performant Router For Observability Data
Episode 97
Summary
The first stage in every data project is collecting information and routing it to a storage system for later analysis. For operational data t…
6 years, 6 months ago
Building A Community For Data Professionals at Data Council
Episode 96
Summary
Data professionals are working in a domain that is rapidly evolving. In order to stay current we need access to deeply technical presentation…
6 years, 6 months ago
Building Tools And Platforms For Data Analytics
Episode 95
Summary
Data engineers are responsible for building tools and platforms to power the workflows of other members of the business. Each group of users …
6 years, 6 months ago
A High Performance Platform For The Full Big Data Lifecycle
Episode 94
Summary
Managing big data projects at scale is a perennial problem, with a wide variety of solutions that have evolved over the past 20 years. One of…
6 years, 6 months ago
Digging Into Data Replication At Fivetran
Episode 93
Summary
The extract and load pattern of data replication is the most commonly needed process in data engineering workflows. Because of the myriad sou…
6 years, 7 months ago
Solving Data Discovery At Lyft
Episode 92
Summary
Data is only valuable if you use it for something, and the first step is knowing that it is available. As organizations grow and data sources…
6 years, 7 months ago
Simplifying Data Integration Through Eventual Connectivity
Episode 91
Summary
The ETL pattern that has become commonplace for integrating data from multiple sources has proven useful, but complex to maintain. For a smal…
6 years, 7 months ago
Straining Your Data Lake Through A Data Mesh
Episode 90
Summary
The current trend in data management is to centralize the responsibilities of storing and curating the organization’s information to a data e…
6 years, 7 months ago
Data Labeling That You Can Feel Good About With CloudFactory
Episode 89
Summary
Successful machine learning and artificial intelligence projects require large volumes of data that is properly labelled. The challenge is th…
6 years, 8 months ago