Podcast Episodes

Back to Search
No image available

Model Context Protocol (MCP)



We cover Anthropic’s groundbreaking Model Context Protocol (MCP). Though it was released in November 2024, we've been seeing a lot of hype around it lately, and thought it was well worth digging into…


Published on 5 months, 1 week ago

No image available

AI Roundup: DeepSeek’s Big Moves, Claude 3.7, and the Latest Breakthroughs



This week, we're mixing things up a little bit. Instead of diving deep into a single research paper, we cover the biggest AI developments from the past few weeks.

We break down key announcements, incl…


Published on 6 months, 1 week ago

No image available

How DeepSeek is Pushing the Boundaries of AI Development



This week, we dive into DeepSeek. SallyAnn DeLucia, Product Manager at Arize, and Nick Luzio, a Solutions Engineer, break down key insights on a model that have dominating headlines for its significa…


Published on 6 months, 2 weeks ago

No image available

Multiagent Finetuning: A Conversation with Researcher Yilun Du



We talk to Google DeepMind Senior Research Scientist (and incoming Assistant Professor at Harvard), Yilun Du, about his latest paper, "Multiagent Finetuning: Self Improvement with Diverse Reasoning C…


Published on 7 months ago

No image available

Training Large Language Models to Reason in Continuous Latent Space



LLMs have typically been restricted to reason in the "language space," where chain-of-thought (CoT) is used to solve complex reasoning problems. But a new paper argues that language space may not alw…


Published on 7 months, 3 weeks ago

No image available

LLMs as Judges: A Comprehensive Survey on LLM-Based Evaluation Methods



We discuss a major survey of work and research on LLM-as-Judge from the last few years. "LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods" systematically examines the LLMs-as-Ju…


Published on 8 months, 2 weeks ago

No image available

Merge, Ensemble, and Cooperate! A Survey on Collaborative LLM Strategies



LLMs have revolutionized natural language processing, showcasing remarkable versatility and capabilities. But individual LLMs often exhibit distinct strengths and weaknesses, influenced by difference…


Published on 8 months, 4 weeks ago

No image available

Agent-as-a-Judge: Evaluate Agents with Agents



This week, we break down the “Agent-as-a-Judge” framework—a new agent evaluation paradigm that’s kind of like getting robots to grade each other’s homework. Where typical evaluation methods focus sol…


Published on 9 months, 2 weeks ago

No image available

Introduction to OpenAI's Realtime API



We break down OpenAI’s realtime API. Learn how to seamlessly integrate powerful language models into your applications for instant, context-aware responses that drive user engagement. Whether you’re …


Published on 9 months, 3 weeks ago

No image available

Swarm: OpenAI's Experimental Approach to Multi-Agent Systems



As multi-agent systems grow in importance for fields ranging from customer support to autonomous decision-making, OpenAI has introduced Swarm, an experimental framework that simplifies the process of…


Published on 10 months, 1 week ago





If you like Podbriefly.com, please consider donating to support the ongoing development.

Donate