Podcast Episodes

Back to Search
Performance Optimization and Software/Hardware Co-design across PyTorch, CUDA, and NVIDIA GPUs
Performance Optimization and Software/Hardware Co-design across PyTorch, CUDA, and NVIDIA GPUs

March 3rd, Computer History Museum CODING AGENTS CONFERENCE, come join us while there are still tickets left.

https://luma.com/codingagents


Chris Fregl…

4 months ago

Short Long
View Episode
Serving LLMs in Production: Performance, Cost & Scale // CAST AI Roundtable
Serving LLMs in Production: Performance, Cost & Scale // CAST AI Roundtable

Roundtable CAST AI episode: Serving LLMs in Production: Performance, Cost & Scale.


Join the Community:

https://go.mlops.community/YTJoinIn

Get the new…

4 months, 1 week ago

Short Long
View Episode
The Future of Information Retrieval: From Dense Vectors to Cognitive Search
The Future of Information Retrieval: From Dense Vectors to Cognitive Search

Rahul Raja is a Staff Software Engineer at LinkedIn, working on large-scale search infrastructure, information retrieval systems, and integrating AI/…

4 months, 1 week ago

Short Long
View Episode
Rethinking Notebooks Powered by AI
Rethinking Notebooks Powered by AI

Vincent Warmerdam is a Founding Engineer at marimo, working on reinventing Python notebooks as reactive, reproducible, interactive, and Git-friendly …

4 months, 2 weeks ago

Short Long
View Episode
Software Engineering in the Age of Coding Agents: Testing, Evals, and Shipping Safely at Scale
Software Engineering in the Age of Coding Agents: Testing, Evals, and Shipping Safely at Scale

Ereli Eran is the Founding Engineer at 7AI, where he’s focused on building and scaling the company’s agentic AI-driven cybersecurity platform — devel…

4 months, 2 weeks ago

Short Long
View Episode
Physical AI: Teaching Machines to Understand the Real World
Physical AI: Teaching Machines to Understand the Real World

Nick Gillian is the Co-Founder and CTO at Archetype AI, working on physical AI foundation models that understand and reason over real-world sensor da…

4 months, 3 weeks ago

Short Long
View Episode
Speed and Scale: How Today's AI Datacenters Are Operating Through Hypergrowth
Speed and Scale: How Today's AI Datacenters Are Operating Through Hypergrowth

Kris Beevers is the CEO at NetBox Labs, working on turning NetBox into the system of record and automation backbone for modern and AI-driven infrastr…

4 months, 3 weeks ago

Short Long
View Episode
Cracking the Black Box: Real-Time Neuron Monitoring & Causality Traces
Cracking the Black Box: Real-Time Neuron Monitoring & Causality Traces

Mike Oaten is the Founder and CEO of TIKOS, working on building AI assurance, explainability, and trustworthy AI infrastructure, helping organization…

5 months ago

Short Long
View Episode
A Playground for AI/ML Engineers
A Playground for AI/ML Engineers

Paulo Vasconcellos is the Principal Data Scientist for Generative AI Products at Hotmart, working on AI-powered creator and learning experiences, inc…

5 months ago

Short Long
View Episode
How Universal Resource Management Transforms AI Infrastructure Economics
How Universal Resource Management Transforms AI Infrastructure Economics

Wilder Lopes is the CEO and Founder of Ogre.run, working on AI-driven dependency resolution and reproducible code execution across environments.How U…

5 months, 1 week ago

Short Long
View Episode

Love PodBriefly?

If you like Podbriefly.com, please consider donating to support the ongoing development.

Support Us