Podcast Episodes
Back to SearchAISN #24: Kissinger Urges US-China Cooperation on AI, China’s New AI Law, US Export Controls, International Institutions, and Open Source AI.
Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.
China'…
2 years, 7 months ago
AISN #23: New OpenAI Models, News from Anthropic, and Representation Engineering.
Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.
OpenAI…
2 years, 7 months ago
AISN #23: New OpenAI Models, News from Anthropic, and Representation Engineering.
Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.
OpenAI…
2 years, 7 months ago
AISN #22: Hearings, Frameworks, Bills, and Laws.
Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.
This w…
2 years, 8 months ago
AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous Drones, The UK AI Safety Summit, and Case Studies in AI Policy.
Welcome to the AI Safety Newsletter by the Center for AI Safety. We discuss developments in AI and AI safety. No technical background required.
Google…
2 years, 8 months ago
AISN #20: LLM Proliferation, AI Deception, and Continuing Drivers of AI Capabilities.
AI Deception: Examples, Risks, Solutions
AI deception is the topic of a new paper from researchers at and affiliated with the Center for AI Safety. It…
2 years, 9 months ago
[Paper] “An Overview of Catastrophic AI Risks” by Dan Hendrycks, Mantas Mazeika and Thomas Woodside
Rapid advancements in artificial intelligence (AI) have sparked growing concerns among experts, policymakers, and world leaders regarding the potenti…
2 years, 9 months ago
[Paper] “An Overview of Catastrophic AI Risks” by Dan Hendrycks, Mantas Mazeika and Thomas Woodside
Rapid advancements in artificial intelligence (AI) have sparked growing concerns among experts, policymakers, and world leaders regarding the potenti…
2 years, 9 months ago
[Paper] “X-Risk Analysis for AI Research” by Dan Hendrycks and Mantas Mazeika
Artificial intelligence (AI) has the potential to greatly improve society, but as with any powerful technology, it comes with heightened risks and re…
2 years, 9 months ago
[Paper] “Unsolved Problems in ML Safety” by Dan Hendrycks, Nicholas Carlini, John Schulman and Jacob Steinhardt
Machine learning (ML) systems are rapidly increasing in size, are acquiring new capabilities, and are increasingly deployed in high-stakes settings. …
2 years, 9 months ago