Podcast Episodes

Back to Search
No image available

Stan Miasnikov, Distinguished Engineer, AI/ML Architecture, Consumer Experience at Verizon Walks Us Through His New Paper



This episode dives into "Category-Theoretic Analysis of Inter-Agent Communication and Mutual Understanding Metric in Recursive Consciousness." The paper presents an extension of the Recursive Conscio…


Published on 1 day, 8 hours ago

No image available

Small Language Models are the Future of Agentic AI



We had the privilege of hosting Peter Belcak – an AI Researcher working on the reliability and efficiency of agentic systems at NVIDIA – who walked us through his new paper making the rounds in AI ci…


Published on 2 days, 10 hours ago

No image available

Watermarking for LLMs and Image Models



In this AI research paper reading, we dive into "A Watermark for Large Language Models" with the paper's author John Kirchenbauer. 

This paper is a timely exploration of techniques for embedding invis…


Published on 1 month, 1 week ago

No image available

Self-Adapting Language Models: Paper Authors Discuss Implications



The authors of the new paper *Self-Adapting Language Models (SEAL)* shared a behind-the-scenes look at their work, motivations, results, and future directions.

The paper introduces a novel method for …


Published on 1 month, 4 weeks ago

No image available

The Illusion of Thinking: What the Apple AI Paper Says About LLM Reasoning



This week we discuss The Illusion of Thinking, a new paper from researchers at Apple that challenges today’s evaluation methods and introduces a new benchmark: synthetic puzzles with controllable com…


Published on 2 months, 2 weeks ago

No image available

Accurate KV Cache Quantization with Outlier Tokens Tracing



We discuss Accurate KV Cache Quantization with Outlier Tokens Tracing, a deep dive into improving the efficiency of LLM inference. The authors enhance KV Cache quantization, a technique for reducing …


Published on 3 months ago

No image available

Scalable Chain of Thoughts via Elastic Reasoning



In this week's episode, we talk about Elastic Reasoning, a novel framework designed to enhance the efficiency and scalability of large reasoning models by explicitly separating the reasoning process …


Published on 3 months, 3 weeks ago

No image available

Sleep-time Compute: Beyond Inference Scaling at Test-time



What if your LLM could think ahead—preparing answers before questions are even asked?

In this week's paper read, we dive into a groundbreaking new paper from researchers at Letta, introducing sleep-ti…


Published on 4 months ago

No image available

LibreEval: The Largest Open Source Benchmark for RAG Hallucination Detection



For this week's paper read, we dive into our own research.

We wanted to create a replicable, evolving dataset that can keep pace with model training so that you always know you're testing with data yo…


Published on 4 months, 2 weeks ago

No image available

AI Benchmark Deep Dive: Gemini 2.5 and Humanity's Last Exam



This week we talk about modern AI benchmarks, taking a close look at Google's recent Gemini 2.5 release and its performance on key evaluations, notably  Humanity's Last Exam (HLE). In the session we …


Published on 5 months ago





If you like Podbriefly.com, please consider donating to support the ongoing development.

Donate