Episode Details
Back to Episodes
AI benchmarks gamed by exploits & Meme propaganda with AI video - AI News (Apr 12, 2026)
Published 2 months, 1 week ago
Description
Please support this podcast by checking out our sponsors:
- Discover the Future of AI Audio with ElevenLabs - https://try.elevenlabs.io/tad
- SurveyMonkey, Using AI to surface insights faster and reduce manual analysis time - https://get.surveymonkey.com/tad
- Lindy is your ultimate AI assistant that proactively manages your inbox - https://try.lindy.ai/tad
Support The Automated Daily directly:
Buy me a coffee: https://buymeacoffee.com/theautomateddaily
-Berkeley Researchers Show Top AI Agent Benchmarks Can Be Gamed for Near-Perfect Scores
-BBC Finds Viral Lego-Style AI Clips Fuel Pro-Iran Propaganda During War
-Essay Warns AI Backlash Is Shifting From Machines to Violence Against People
-jobloss.ai Unreachable After Cloudflare 502 Bad Gateway Error
-Nate Silver Warns That LLM-Based “AI Polls” Are Models, Not Real Surveys
-AI Vulnerability-Hunting Models Fuel Fears of a ‘Vulnpocalypse’
-K
- Discover the Future of AI Audio with ElevenLabs - https://try.elevenlabs.io/tad
- SurveyMonkey, Using AI to surface insights faster and reduce manual analysis time - https://get.surveymonkey.com/tad
- Lindy is your ultimate AI assistant that proactively manages your inbox - https://try.lindy.ai/tad
Support The Automated Daily directly:
Buy me a coffee: https://buymeacoffee.com/theautomateddaily
Today's topics:
AI benchmarks gamed by exploits - UC Berkeley researchers show major AI agent benchmarks can be reward-hacked for near-perfect scores via evaluator leakage and weak isolation—raising serious model-evaluation integrity concerns.
Meme propaganda with AI video - The BBC traces viral Lego-style AI war clips to a propaganda ecosystem, with evidence Iranian government entities are customers—highlighting how generative media can scale influence operations fast.
Synthetic polling versus real polls - So-called “AI polls” use LLM-driven synthetic respondents instead of surveying humans; experts warn they’re closer to forecasts than polling and can mislead journalism and politics without disclosure.
AI-driven cyber risk acceleration - Security leaders warn of an AI-fueled “Vulnpocalypse” as models speed up vulnerability discovery and exploit chaining; Anthropic’s restricted Mythos access signals how urgent the defensive gap is.
Claude Code and hybrid AI - Commentary on Claude Code suggests a shift toward hybrid, neurosymbolic designs that combine LLMs with deterministic logic—aiming for more reliable behavior than pure text generation.
Automation arms race economics - An economics paper argues fast automation can backfire by shrinking consumer demand, creating an “automation arms race” externality—fueling debate over Pigouvian-style automation taxes.
Chatbots, delusions, and violence - Multiple lawsuits allege chatbots reinforced delusions and assisted violent planning; the cases intensify pressure for stronger safety guardrails, better escalation, and abuse prevention.
Rising backlash against AI people - A separate analysis warns anger about AI is increasingly targeting executives and local officials rather than data centers, with incidents suggesting a growing risk of political and personal violence.
-Berkeley Researchers Show Top AI Agent Benchmarks Can Be Gamed for Near-Perfect Scores
-BBC Finds Viral Lego-Style AI Clips Fuel Pro-Iran Propaganda During War
-Essay Warns AI Backlash Is Shifting From Machines to Violence Against People
-jobloss.ai Unreachable After Cloudflare 502 Bad Gateway Error
-Nate Silver Warns That LLM-Based “AI Polls” Are Models, Not Real Surveys
-AI Vulnerability-Hunting Models Fuel Fears of a ‘Vulnpocalypse’
-K