Podcast Episodes

No image available

Model Context Protocol (MCP)

We cover Anthropic’s groundbreaking Model Context Protocol (MCP). Though it was released in November 2024, we've been seeing a lot of hype around it lately, and thought it was well worth digging into…

Published on 5 months, 1 week ago

View Episode

Short Summary Not Available Long Summary Not Available

No image available

AI Roundup: DeepSeek’s Big Moves, Claude 3.7, and the Latest Breakthroughs

This week, we're mixing things up a little bit. Instead of diving deep into a single research paper, we cover the biggest AI developments from the past few weeks.

We break down key announcements, incl…

Published on 6 months, 1 week ago

View Episode

Short Summary Not Available Long Summary Not Available

No image available

How DeepSeek is Pushing the Boundaries of AI Development

This week, we dive into DeepSeek. SallyAnn DeLucia, Product Manager at Arize, and Nick Luzio, a Solutions Engineer, break down key insights on a model that have dominating headlines for its significa…

Published on 6 months, 2 weeks ago

View Episode

Short Summary Not Available Long Summary Not Available

No image available

Multiagent Finetuning: A Conversation with Researcher Yilun Du

We talk to Google DeepMind Senior Research Scientist (and incoming Assistant Professor at Harvard), Yilun Du, about his latest paper, "Multiagent Finetuning: Self Improvement with Diverse Reasoning C…

Published on 7 months ago

View Episode

Short Summary Not Available Long Summary Not Available

No image available

Training Large Language Models to Reason in Continuous Latent Space

LLMs have typically been restricted to reason in the "language space," where chain-of-thought (CoT) is used to solve complex reasoning problems. But a new paper argues that language space may not alw…

Published on 7 months, 3 weeks ago

View Episode

Short Summary Not Available Long Summary Not Available

No image available

LLMs as Judges: A Comprehensive Survey on LLM-Based Evaluation Methods

We discuss a major survey of work and research on LLM-as-Judge from the last few years. "LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods" systematically examines the LLMs-as-Ju…

Published on 8 months, 2 weeks ago

View Episode

Short Summary Not Available Long Summary Not Available

No image available

Merge, Ensemble, and Cooperate! A Survey on Collaborative LLM Strategies

LLMs have revolutionized natural language processing, showcasing remarkable versatility and capabilities. But individual LLMs often exhibit distinct strengths and weaknesses, influenced by difference…

Published on 8 months, 4 weeks ago

View Episode

Short Summary Not Available Long Summary Not Available

No image available

Agent-as-a-Judge: Evaluate Agents with Agents

This week, we break down the “Agent-as-a-Judge” framework—a new agent evaluation paradigm that’s kind of like getting robots to grade each other’s homework. Where typical evaluation methods focus sol…

Published on 9 months, 2 weeks ago

View Episode

Short Summary Not Available Long Summary Not Available

No image available

Introduction to OpenAI's Realtime API

We break down OpenAI’s realtime API. Learn how to seamlessly integrate powerful language models into your applications for instant, context-aware responses that drive user engagement. Whether you’re …

Published on 9 months, 3 weeks ago

View Episode

Short Summary Not Available Long Summary Not Available

No image available

Swarm: OpenAI's Experimental Approach to Multi-Agent Systems

As multi-agent systems grow in importance for fields ranging from customer support to autonomous decision-making, OpenAI has introduced Swarm, an experimental framework that simplifies the process of…

Published on 10 months, 1 week ago

View Episode

Short Summary Not Available Long Summary Not Available

1
2
3
4

If you like Podbriefly.com, please consider donating to support the ongoing development.

Donate

© 2025 Developer Service

Developer Service

Courses Developer Service

Is It Clickbait