Podcast Episodes
Back to Search
Building A Cost Effective Data Catalog With Tree Schema
Episode 158
Summary
A data catalog is a critical piece of infrastructure for any organization who wants to build analytics products, whether internal or external…
5 years, 4 months ago
Add Version Control To Your Data Lake With LakeFS
Episode 157
Summary
Data lakes are gaining popularity due to their flexibility and reduced cost of storage. Along with the benefits there are some additional com…
5 years, 4 months ago
Cloud Native Data Security As Code With Cyral
Episode 156
Summary
One of the most challenging aspects of building a data platform has nothing to do with pipelines and transformations. If you are putting your…
5 years, 4 months ago
Better Data Quality Through Observability With Monte Carlo
Episode 155
Summary
In order for analytics and machine learning projects to be useful, they require a high degree of data quality. To ensure that your pipelines …
5 years, 4 months ago
Rapid Delivery Of Business Intelligence Using Power BI
Episode 154
Summary
Business intelligence efforts are only as useful as the outcomes that they inform. Power BI aims to reduce the time and effort required to go…
5 years, 5 months ago
Self Service Real Time Data Integration Without The Headaches With Meroxa
Episode 153
Summary
Analytical workloads require a well engineered and well maintained data integration process to ensure that your information is reliable and u…
5 years, 5 months ago
Speed Up And Simplify Your Streaming Data Workloads With Red Panda
Episode 152
Summary
Kafka has become a de facto standard interface for building decoupled systems and working with streaming data. Despite its widespread popular…
5 years, 5 months ago
Cutting Through The Noise And Focusing On The Fundamentals Of Data Engineering With The Data Janitor
Episode 151
Summary
Data engineering is a constantly growing and evolving discipline. There are always new tools, systems, and design patterns to learn, which le…
5 years, 5 months ago
Distributed In Memory Processing And Streaming With Hazelcast
Episode 150
Summary
In memory computing provides significant performance benefits, but brings along challenges for managing failures and scaling up. Hazelcast is…
5 years, 6 months ago
Simplify Your Data Architecture With The Presto Distributed SQL Engine
Episode 149
Summary
Databases are limited in scope to the information that they directly contain. For analytical use cases you often want to combine data across …
5 years, 6 months ago