Podcast Episodes
Back to Search
Maintaining Your Data Lake At Scale With Spark
Episode 85
Summary
Building and maintaining a data lake is a choose your own adventure of tools, services, and evolving best practices. The flexibility and free…
6 years, 11 months ago
Managing The Machine Learning Lifecycle
Episode 84
Summary
Building a machine learning model can be difficult, but that is only half of the battle. Having a perfect model is only useful if you are abl…
6 years, 11 months ago
Evolving An ETL Pipeline For Better Productivity
Episode 83
Summary
Building an ETL pipeline can be a significant undertaking, and sometimes it needs to be rebuilt when a better option becomes available. In th…
7 years ago
Data Lineage For Your Pipelines
Episode 82
Summary
Some problems in data are well defined and benefit from a ready-made set of tools. For everything else, there’s Pachyderm, the platform for d…
7 years ago
Build Your Data Analytics Like An Engineer With DBT
Episode 81
Summary
In recent years the traditional approach to building data warehouses has shifted from transforming records before loading, to transforming th…
7 years ago
Using FoundationDB As The Bedrock For Your Distributed Systems
Episode 80
Summary
The database market continues to expand, offering systems that are suited to virtually every use case. But what happens if you need something…
7 years, 1 month ago
Running Your Database On Kubernetes With KubeDB
Episode 79
Summary
Kubernetes is a driving force in the renaissance around deploying and running applications. However, managing the database layer is still a s…
7 years, 1 month ago
Unpacking Fauna: A Global Scale Cloud Native Database
Episode 78
Summary
One of the biggest challenges for any business trying to grow and reach customers globally is how to scale their data storage. FaunaDB is a c…
7 years, 1 month ago
Index Your Big Data With Pilosa For Faster Analytics
Episode 77
Summary
Database indexes are critical to ensure fast lookups of your data, but they are inherently tied to the database engine. Pilosa is rewriting t…
7 years, 1 month ago
Serverless Data Pipelines On DataCoral
Episode 76
Summary
How much time do you spend maintaining your data pipeline? How much end user value does that provide? Raghu Murthy founded DataCoral as a way…
7 years, 2 months ago