Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
14 posts
Threat Models
Sharp Left Turn
18 posts
Coordination / Cooperation
Fiction
Site Meta
AI Risk Concrete Stories
AMA
Multipolar Scenarios
Prisoner's Dilemma
Paperclip Maximizer
Moloch
Q&A (format)
38
AI Neorealism: a threat model & success criterion for existential safety
davidad
5d
0
42
Refining the Sharp Left Turn threat model, part 2: applying alignment techniques
Vika
25d
4
214
A central AI alignment problem: capabilities generalization, and the sharp left turn
So8res
6mo
48
28
AI X-risk >35% mostly based on a recent peer-reviewed argument
michaelcohen
1mo
31
59
Refining the Sharp Left Turn threat model, part 1: claims and mechanisms
Vika
4mo
3
33
We may be able to see sharp left turns coming
Ethan Perez
3mo
26
180
Another (outer) alignment failure story
paulfchristiano
1y
38
109
Less Realistic Tales of Doom
Mark Xu
1y
13
87
Rogue AGI Embodies Valuable Intellectual Property
Mark Xu
1y
9
222
What failure looks like
paulfchristiano
3y
49
58
Vignettes Workshop (AI Impacts)
Daniel Kokotajlo
1y
3
43
Distinguishing AI takeover scenarios
Sam Clarke
1y
11
51
Survey on AI existential risk scenarios
Sam Clarke
1y
11
26
Investigating AI Takeover Scenarios
Sammy Martin
1y
1
121
The next decades might be wild
Marius Hobbhahn
5d
21
132
AI coordination needs clear wins
evhub
3mo
15
255
It Looks Like You're Trying To Take Over The World
gwern
9mo
125
118
Late 2021 MIRI Conversations: AMA / Discussion
Rob Bensinger
9mo
208
115
Welcome & FAQ!
Ruby
1y
8
154
What Multipolar Failure Looks Like, and Robust Agent-Agnostic Processes (RAAPs)
Andrew_Critch
1y
60
65
We're Redwood Research, we do applied alignment research, AMA
Nate Thomas
1y
3
64
AI takeoff story: a continuation of progress by other means
Edouard Harris
1y
13
42
My Overview of the AI Alignment Landscape: Threat Models
Neel Nanda
12mo
4
76
What Failure Looks Like: Distilling the Discussion
Ben Pace
2y
14
116
Prisoners' Dilemma with Costs to Modeling
Scott Garrabrant
4y
20
48
Clarifying “What failure looks like”
Sam Clarke
2y
14
26
(apologies for Alignment Forum server outage last night)
Ruby
1y
1
91
Introducing the AI Alignment Forum (FAQ)
habryka
4y
8