Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
32 posts
Threat Models
Coordination / Cooperation
Sharp Left Turn
Fiction
AI Risk Concrete Stories
Site Meta
AMA
Multipolar Scenarios
Prisoner's Dilemma
Moloch
Paperclip Maximizer
Q&A (format)
51 posts
World Optimization
Existential Risk
Practical
Academic Papers
Ethics & Morality
AI Safety Camp
Symbol Grounding
Security Mindset
Software Tools
Careers
Surveys
Updated Beliefs (examples of)
121
The next decades might be wild
Marius Hobbhahn
5d
21
28
AI X-risk >35% mostly based on a recent peer-reviewed argument
michaelcohen
1mo
31
33
We may be able to see sharp left turns coming
Ethan Perez
3mo
26
42
Refining the Sharp Left Turn threat model, part 2: applying alignment techniques
Vika
25d
4
214
A central AI alignment problem: capabilities generalization, and the sharp left turn
So8res
6mo
48
132
AI coordination needs clear wins
evhub
3mo
15
59
Refining the Sharp Left Turn threat model, part 1: claims and mechanisms
Vika
4mo
3
115
Welcome & FAQ!
Ruby
1y
8
118
Late 2021 MIRI Conversations: AMA / Discussion
Rob Bensinger
9mo
208
42
My Overview of the AI Alignment Landscape: Threat Models
Neel Nanda
12mo
4
76
What Failure Looks Like: Distilling the Discussion
Ben Pace
2y
14
180
Another (outer) alignment failure story
paulfchristiano
1y
38
87
Rogue AGI Embodies Valuable Intellectual Property
Mark Xu
1y
9
91
Introducing the AI Alignment Forum (FAQ)
habryka
4y
8
106
Thoughts on AGI organizations and capabilities work
Rob Bensinger
13d
17
27
Deconfusing Direct vs Amortised Optimization
beren
18d
6
106
Don't leave your fingerprints on the future
So8res
2mo
32
92
An Update on Academia vs. Industry (one year into my faculty job)
David Scott Krueger (formerly: capybaralet)
3mo
18
25
The Dumbest Possible Gets There First
Artaxerxes
4mo
7
13
Concrete Advice for Forming Inside Views on AI Safety
Neel Nanda
4mo
6
28
A survey of tool use and workflows in alignment research
Logan Riggs
9mo
5
20
Some ideas for epistles to the AI ethicists
Charlie Steiner
3mo
0
44
What technologies could cause world GDP doubling times to be <8 years?
Daniel Kokotajlo
2y
44
39
New paper: Corrigibility with Utility Preservation
Koen.Holtman
3y
11
39
AI x-risk reduction: why I chose academia over industry
David Scott Krueger (formerly: capybaralet)
1y
14
26
[Linkpost] Existential Risk Analysis in Empirical Research Papers
Dan H
5mo
0
19
Reading the ethicists 2: Hunting for AI alignment papers
Charlie Steiner
6mo
1
31
Techniques for optimizing worst-case performance
paulfchristiano
3y
12