Podcast Episodes
Back to Search
TimescaleDB: Fast And Scalable Timeseries with Ajay Kulkarni and Mike Freedman - Episode 18
Episode 18
Summary
As communications between machines become more commonplace the need to store the generated data in a time-oriented manner increases. The mar…
8 years ago
Pulsar: Fast And Scalable Messaging with Rajan Dhabalia and Matteo Merli - Episode 17
Episode 17
Summary
One of the critical components for modern data infrastructure is a scalable and reliable messaging system. Publish-subscribe systems have be…
8 years, 1 month ago
Dat: Distributed Versioned Data Sharing with Danielle Robinson and Joe Hand - Episode 16
Episode 16
Summary
Sharing data across multiple computers, particularly when it is large and changing, is a difficult problem to solve. In order to provide a si…
8 years, 1 month ago
Snorkel: Extracting Value From Dark Data with Alex Ratner - Episode 15
Episode 15
Summary
The majority of the conversation around machine learning and big data pertains to well-structured and cleaned data sets. Unfortunately, that…
8 years, 1 month ago
CRDTs and Distributed Consensus with Christopher Meiklejohn - Episode 14
Episode 14
Summary
As we scale our systems to handle larger volumes of data, geographically distributed users, and varied data sources the requirement to distr…
8 years, 1 month ago
Citus Data: Distributed PostGreSQL for Big Data with Ozgun Erdogan and Craig Kerstiens - Episode 13
Episode 13
Summary
PostGreSQL has become one of the most popular and widely used databases, and for good reason. The level of extensibility that it supports ha…
8 years, 2 months ago
Wallaroo with Sean T. Allen - Episode 12
Episode 12
Summary
Data oriented applications that need to operate on large, fast-moving sterams of information can be difficult to build and scale due to the …
8 years, 2 months ago
SiriDB: Scalable Open Source Timeseries Database with Jeroen van der Heijden - Episode 11
Episode 11
Summary
Time series databases have long been the cornerstone of a robust metrics system, but the existing options are often difficult to manage in p…
8 years, 2 months ago
Confluent Schema Registry with Ewen Cheslack-Postava - Episode 10
Episode 10
Summary
To process your data you need to know what shape it has, which is why schemas are important. When you are processing that data in multiple s…
8 years, 2 months ago
data.world with Bryon Jacob - Episode 9
Episode 9
Summary
We have tools and platforms for collaborating on software projects and linking them together, wouldn’t it be nice to have the same capabilit…
8 years, 3 months ago