Episode Details

Back to Episodes
CEO “AI psychosis” and layoffs & Legal and coding benchmarks reality - AI News (May 28, 2026)

CEO “AI psychosis” and layoffs & Legal and coding benchmarks reality - AI News (May 28, 2026)

Published 3 weeks, 4 days ago
Description
Please support this podcast by checking out our sponsors:
- Lindy is your ultimate AI assistant that proactively manages your inbox - https://try.lindy.ai/tad
- Consensus: AI for Research. Get a free month - https://get.consensus.app/automated_daily
- SurveyMonkey, Using AI to surface insights faster and reduce manual analysis time - https://get.surveymonkey.com/tad


Support The Automated Daily directly:
Buy me a coffee: https://buymeacoffee.com/theautomateddaily

Today's topics:

CEO “AI psychosis” and layoffs - TechCrunch spotlights “AI psychosis,” where executives over-believe agent automation after glossy demos, fueling layoffs despite mixed productivity evidence.

Legal and coding benchmarks reality - Two new yardsticks—Legal Agent Benchmark and DeepSWE—show frontier models still struggle with long-horizon, real-world work, emphasizing reliability over hype.

AI claims major math proofs - Anthropic staff say Claude Mythos can tackle the Erdős unit-distance conjecture, echoing OpenAI and DeepMind math wins and reigniting debate over tool-assisted vs “pure” LLM results.

Containing the blast radius of agents - Anthropic details agent security lessons: sandboxes, VMs, and egress controls matter because human approvals are inconsistent and attackers exploit weak boundaries.

AI transparency and anti-AI search - YouTube is making AI-content labels more prominent and adding automatic detection signals, while DuckDuckGo’s AI-free search page sees a surge amid backlash to AI-heavy results.

Customer data used for training - PostHog plans to train in-house models on customer usage data with opt-outs and regional defaults, highlighting the privacy tradeoffs behind “smarter” product features.

GPU tuning, compute, and geopolitics - NVIDIA’s CompileIQ aims to squeeze extra GPU performance via compiler auto-tuning, while SpaceX’s S-1 raises questions about terrestrial vs orbital AI compute—and China tightens travel rules for top AI staff.

Better image generation and AI fluency - Microsoft’s MAI-Image-2.5 climbs leaderboards with better text-in-image control, and Anthropic is reportedly building an AI Fluency scorecard to evaluate how humans use AI, not just how AI performs.



-TechCrunch: CEOs’ ‘AI psychosis’ may be driving overconfident automation and layoffs
-Anthropic’s Claude Mythos Reportedly Reproduces OpenAI’s Erdős Unit-Distance Breakthrough
-Legal Agent Benchmark Early Results Show Low Pass Rates and High Cost for Frontier Models
-PostHog to Train In-House AI Models on Customer Data, With Opt-Out Controls
-Microsoft Launches MAI-Image-2.5, Debuting No. 3 on Arena Text-to-Image Leaderboard
-You.com Guide Warns API Latency Benchmarks Mislead Buyers
-
Listen Now