Podcast Episodes

"The Onion Test for Personal and Institutional Honesty" by Chana Messinger & Andrew Critch

https://www.lesswrong.com/posts/nTGEeRSZrfPiJwkEc/the-onion-test-for-personal-and-institutional-honesty

[co-written by Chana Messinger and Andrew Crit…

3 years, 3 months ago

Short Long

View Episode

"Lies, Damn Lies, and Fabricated Options" by Duncan Sabien

https://www.lesswrong.com/posts/gNodQGNoPDjztasbh/lies-damn-lies-and-fabricated-options

This is an essay about one of those "once you see it, you will…

3 years, 3 months ago

Short Long

View Episode

"What failure looks like" by Paul Christiano

https://www.lesswrong.com/posts/HBxe6wdjxK239zajf/what-failure-looks-like

Crossposted from the AI Alignment Forum. May contain more technical jargon t…

3 years, 3 months ago

Short Long

View Episode

"Why I think strong general AI is coming soon" by Porby

https://www.lesswrong.com/posts/K4urTDkBbtNuLivJx/why-i-think-strong-general-ai-is-coming-soon

I think there is little time left before someone builds…

3 years, 3 months ago

Short Long

View Episode

"It Looks Like You’re Trying To Take Over The World" by Gwern

https://gwern.net/fiction/clippy

In A.D. 20XX. Work was beginning. “How are you gentlemen !!”… (Work. Work never changes; work is always hell.)

Specifi…

3 years, 3 months ago

Short Long

View Episode

"More information about the dangerous capability evaluations we did with GPT-4 and Claude." by Beth Barnes

https://www.lesswrong.com/posts/4Gt42jX7RiaNaxCwP/more-information-about-the-dangerous-capability-evaluations

Crossposted from the AI Alignment Forum.…

3 years, 4 months ago

Short Long

View Episode

""Carefully Bootstrapped Alignment" is organizationally hard" by Raemon

https://www.lesswrong.com/posts/thkAtqoQwN6DtaiGT/carefully-bootstrapped-alignment-is-organizationally-hard

In addition to technical challenges, plans…

3 years, 4 months ago

Short Long

View Episode

"The Parable of the King and the Random Process" by moridinamael

https://www.lesswrong.com/posts/LzQtrHSYDafXynofq/the-parable-of-the-king-and-the-random-process

~ A Parable of Forecasting Under Model Uncertainty ~

Y…

3 years, 4 months ago

Short Long

View Episode

"Enemies vs Malefactors" by Nate Soares

https://www.lesswrong.com/posts/zidQmfFhMgwFzcHhs/enemies-vs-malefactors

Status: some mix of common wisdom (that bears repeating in our particular con…

3 years, 4 months ago

Short Long

View Episode

"The Waluigi Effect (mega-post)" by Cleo Nardo

https://www.lesswrong.com/posts/D7PumeYTDPfBTp3i7/the-waluigi-effect-mega-post

In this article, I will present a mechanistic explanation of the Waluig…

3 years, 4 months ago

Short Long

View Episode

Podcast Episodes

"The Onion Test for Personal and Institutional Honesty" by Chana Messinger & Andrew Critch

"Lies, Damn Lies, and Fabricated Options" by Duncan Sabien

"What failure looks like" by Paul Christiano

"Why I think strong general AI is coming soon" by Porby

"It Looks Like You’re Trying To Take Over The World" by Gwern

"More information about the dangerous capability evaluations we did with GPT-4 and Claude." by Beth Barnes

""Carefully Bootstrapped Alignment" is organizationally hard" by Raemon

"The Parable of the King and the Random Process" by moridinamael

"Enemies vs Malefactors" by Nate Soares

"The Waluigi Effect (mega-post)" by Cleo Nardo

Love PodBriefly?