Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
32 posts
Threat Models
Coordination / Cooperation
Sharp Left Turn
Fiction
AI Risk Concrete Stories
Site Meta
AMA
Multipolar Scenarios
Prisoner's Dilemma
Moloch
Paperclip Maximizer
Q&A (format)
51 posts
World Optimization
Existential Risk
Practical
Academic Papers
Ethics & Morality
AI Safety Camp
Symbol Grounding
Security Mindset
Software Tools
Careers
Surveys
Updated Beliefs (examples of)
386
It Looks Like You're Trying To Take Over The World
gwern
9mo
125
319
What failure looks like
paulfchristiano
3y
49
253
A central AI alignment problem: capabilities generalization, and the sharp left turn
So8res
6mo
48
210
Another (outer) alignment failure story
paulfchristiano
1y
38
203
What Multipolar Failure Looks Like, and Robust Agent-Agnostic Processes (RAAPs)
Andrew_Critch
1y
60
155
The next decades might be wild
Marius Hobbhahn
5d
21
134
AI coordination needs clear wins
evhub
3mo
15
119
Late 2021 MIRI Conversations: AMA / Discussion
Rob Bensinger
9mo
208
116
Prisoners' Dilemma with Costs to Modeling
Scott Garrabrant
4y
20
110
Less Realistic Tales of Doom
Mark Xu
1y
13
109
Welcome & FAQ!
Ruby
1y
8
95
Clarifying “What failure looks like”
Sam Clarke
2y
14
86
Introducing the AI Alignment Forum (FAQ)
habryka
4y
8
79
What Failure Looks Like: Distilling the Discussion
Ben Pace
2y
14
314
How To Get Into Independent Research On Alignment/Agency
johnswentworth
1y
33
270
Six Dimensions of Operational Adequacy in AGI Projects
Eliezer Yudkowsky
6mo
65
199
Some AI research areas and their relevance to existential safety
Andrew_Critch
2y
40
175
Morality is Scary
Wei_Dai
1y
125
143
Reshaping the AI Industry
Thane Ruthenis
6mo
34
135
Possible takeaways from the coronavirus pandemic for slow AI takeoff
Vika
2y
36
118
An Update on Academia vs. Industry (one year into my faculty job)
David Scott Krueger (formerly: capybaralet)
3mo
18
116
How do we prepare for final crunch time?
Eli Tyre
1y
30
95
Moral strategies at different capability levels
Richard_Ngo
4mo
14
94
List of resolved confusions about IDA
Wei_Dai
3y
18
94
Thoughts on AGI organizations and capabilities work
Rob Bensinger
13d
17
93
Don't leave your fingerprints on the future
So8res
2mo
32
88
Linkpost: Github Copilot productivity experiment
Daniel Kokotajlo
3mo
4
78
Nearcast-based "deployment problem" analysis
HoldenKarnofsky
3mo
2