Podcast Episodes

AISN #24: Kissinger Urges US-China Cooperation on AI, China’s New AI Law, US Export Controls, International Institutions, and Open Source AI.

Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.

China'…

2 years, 7 months ago

Short Long

View Episode

AISN #23: New OpenAI Models, News from Anthropic, and Representation Engineering.

Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.

OpenAI…

2 years, 7 months ago

Short Long

View Episode

AISN #23: New OpenAI Models, News from Anthropic, and Representation Engineering.

Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.

OpenAI…

2 years, 7 months ago

Short Long

View Episode

AISN #22: Hearings, Frameworks, Bills, and Laws.

Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.

This w…

2 years, 8 months ago

Short Long

View Episode

AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous Drones, The UK AI Safety Summit, and Case Studies in AI Policy.

Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.

Google…

2 years, 8 months ago

Short Long

View Episode

AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI Capabilities.

AI Deception: Examples, Risks, Solutions

AI deception is the topic of a new paper from researchers at and affiliated with the Center for AI Safety. It…

2 years, 9 months ago

Short Long

View Episode

[Paper] “An Overview of Catastrophic AI Risks” by Dan Hendrycks, Mantas Mazeika and Thomas Woodside

Rapid advancements in artificial intelligence (AI) have sparked growing concerns among experts, policymakers, and world leaders regarding the potenti…

2 years, 9 months ago

Short Long

View Episode

[Paper] “An Overview of Catastrophic AI Risks” by Dan Hendrycks, Mantas Mazeika and Thomas Woodside

Rapid advancements in artificial intelligence (AI) have sparked growing concerns among experts, policymakers, and world leaders regarding the potenti…

2 years, 9 months ago

Short Long

View Episode

[Paper] “X-Risk Analysis for AI Research” by Dan Hendrycks and Mantas Mazeika

Artificial intelligence (AI) has the potential to greatly improve society, but as with any powerful technology, it comes with heightened risks and re…

2 years, 9 months ago

Short Long

View Episode

[Paper] “Unsolved Problems in ML Safety” by Dan Hendrycks, Nicholas Carlini, John Schulman and Jacob Steinhardt

Machine learning (ML) systems are rapidly increasing in size, are acquiring new capabilities, and are increasingly deployed in high-stakes settings. …

2 years, 9 months ago

Short Long

View Episode

Podcast Episodes

AISN #24: Kissinger Urges US-China Cooperation on AI, China’s New AI Law, US Export Controls, International Institutions, and Open Source AI.

AISN #23: New OpenAI Models, News from Anthropic, and Representation Engineering.

AISN #23: New OpenAI Models, News from Anthropic, and Representation Engineering.

AISN #22: Hearings, Frameworks, Bills, and Laws.

AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous Drones, The UK AI Safety Summit, and Case Studies in AI Policy.

AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI Capabilities.

[Paper] “An Overview of Catastrophic AI Risks” by Dan Hendrycks, Mantas Mazeika and Thomas Woodside

[Paper] “An Overview of Catastrophic AI Risks” by Dan Hendrycks, Mantas Mazeika and Thomas Woodside

[Paper] “X-Risk Analysis for AI Research” by Dan Hendrycks and Mantas Mazeika

[Paper] “Unsolved Problems in ML Safety” by Dan Hendrycks, Nicholas Carlini, John Schulman and Jacob Steinhardt

Love PodBriefly?