Podcast Episodes
Back to Search
Scale Your Analytics On The Clickhouse Data Warehouse
Episode 88
Summary
The market for data warehouse platforms is large and varied, with options for every use case. ClickHouse is an open source, column-oriented d…
6 years, 8 months ago
Stress Testing Kafka And Cassandra For Real-Time Anomaly Detection
Episode 87
Summary
Anomaly detection is a capability that is useful in a variety of problem domains, including finance, internet of things, and systems monitori…
6 years, 8 months ago
The Workflow Engine For Data Engineers And Data Scientists
Episode 86
Summary
Building a data platform that works equally well for data engineering and data science is a task that requires familiarity with the needs of …
6 years, 8 months ago
Maintaining Your Data Lake At Scale With Spark
Episode 85
Summary
Building and maintaining a data lake is a choose your own adventure of tools, services, and evolving best practices. The flexibility and free…
6 years, 8 months ago
Managing The Machine Learning Lifecycle
Episode 84
Summary
Building a machine learning model can be difficult, but that is only half of the battle. Having a perfect model is only useful if you are abl…
6 years, 9 months ago
Evolving An ETL Pipeline For Better Productivity
Episode 83
Summary
Building an ETL pipeline can be a significant undertaking, and sometimes it needs to be rebuilt when a better option becomes available. In th…
6 years, 9 months ago
Data Lineage For Your Pipelines
Episode 82
Summary
Some problems in data are well defined and benefit from a ready-made set of tools. For everything else, there’s Pachyderm, the platform for d…
6 years, 9 months ago
Build Your Data Analytics Like An Engineer With DBT
Episode 81
Summary
In recent years the traditional approach to building data warehouses has shifted from transforming records before loading, to transforming th…
6 years, 9 months ago
Using FoundationDB As The Bedrock For Your Distributed Systems
Episode 80
Summary
The database market continues to expand, offering systems that are suited to virtually every use case. But what happens if you need something…
6 years, 10 months ago
Running Your Database On Kubernetes With KubeDB
Episode 79
Summary
Kubernetes is a driving force in the renaissance around deploying and running applications. However, managing the database layer is still a s…
6 years, 10 months ago