Podcast Episodes
Back to Search
AI Accelerates: New Gemini Model + AI Unemployment Stories Analysed
Season 2 Episode 19
There’s a new best language model, so let’s go through the up and downs of Gemini 2.5 Pro 06-05. Record-breaking common-sense, but dumb mistakes rema…
1 year ago
Claude 4: Full 120 Page Breakdown … Is it the Best New Model?
Season 2 Episode 18
Not only did I get early access and ran my own tests, as per the title I read both the 120 page Claude 4 Opus and Claude 4 Sonnet System Card, and 25…
1 year ago
Google Takes No Prisoners Amid Torrent of AI Announcements
Season 2 Episode 17
Google just announced at least 12 things that are each worthy of a video, but here are the top I/O highlights. From Veo 3 to Deep Research now being …
1 year ago
AI Improves at Self-improving
Season 2 Episode 16
AlphaEvolve is not the first system to exhibit self-improvement, but it may be the most impressive yet. AI is literally improving the hardware, archi…
1 year ago
o3 breaks (some) records, but AI becomes pay-to-win
Season 2 Episode 15
A green card, o3 vs Gemini 2.5, 6 Benchmarks and a whole bunch of my thoughts on what on earth is happening in AI, from here to 2030. Plus, how AI is…
1 year, 1 month ago
o3 and o4-mini - they’re great, but easy to over-hype
Season 2 Episode 14
Critical analysis of the two most powerful new models behind ChatGPT, o3 and o4-mini. Not just the system cards, benchmarks, and my own tests, but so…
1 year, 1 month ago
‘Speaking Dolphin’ to AI Data Dominance, 4.1 + Kling 2: 7 Developments Critically Analysed
Season 2 Episode 13
This pod won’t just be about the release of GPT 4.1 in the last 48 hours, o3 build-up, Kling 2.0, a sneak-peak at the next OpenAI model, or even the …
1 year, 1 month ago
AI CEO: ‘Stock Crash Could Stop AI Progress’, Llama 4 Anti-climax +‘Superintelligence in 2027’...
Season 2 Episode 12
The latest on Llama 4, and whether it signals a slowdown in AI, or solid progress. Plus, a deep dive on that viral prediction of superintelligence by…
1 year, 2 months ago
Gemini 2.5 Pro - It’s a Smart Chatbot … (New Simple High Score)
Season 2 Episode 11
Gemini gets a new record on Simple Bench, and several other benchmarks. I’ll go deep to explore its nuances, including how it deceptively reverse eng…
1 year, 2 months ago
Did AI Just Get Commoditized? Gemini 2.5, New DeepSeek V3, & Microsoft vs OpenAI
Season 2 Episode 10
Gemini 2.5 is out, on the same day as the new DeepSeek V3 (which should power Deepseek R2). Do both models prove AI is being commoditized? Let’s find…
1 year, 2 months ago