Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
32 posts
Threat Models
Coordination / Cooperation
Sharp Left Turn
Fiction
AI Risk Concrete Stories
Site Meta
AMA
Multipolar Scenarios
Prisoner's Dilemma
Moloch
Paperclip Maximizer
Q&A (format)
51 posts
World Optimization
Existential Risk
Practical
Academic Papers
Ethics & Morality
AI Safety Camp
Symbol Grounding
Security Mindset
Software Tools
Careers
Surveys
Updated Beliefs (examples of)
189
The next decades might be wild
Marius Hobbhahn
5d
21
44
AI X-risk >35% mostly based on a recent peer-reviewed argument
michaelcohen
1mo
31
67
We may be able to see sharp left turns coming
Ethan Perez
3mo
26
30
Refining the Sharp Left Turn threat model, part 2: applying alignment techniques
Vika
25d
4
292
A central AI alignment problem: capabilities generalization, and the sharp left turn
So8res
6mo
48
136
AI coordination needs clear wins
evhub
3mo
15
83
Refining the Sharp Left Turn threat model, part 1: claims and mechanisms
Vika
4mo
3
103
Welcome & FAQ!
Ruby
1y
8
120
Late 2021 MIRI Conversations: AMA / Discussion
Rob Bensinger
9mo
208
58
My Overview of the AI Alignment Landscape: Threat Models
Neel Nanda
12mo
4
82
What Failure Looks Like: Distilling the Discussion
Ben Pace
2y
14
240
Another (outer) alignment failure story
paulfchristiano
1y
38
53
Rogue AGI Embodies Valuable Intellectual Property
Mark Xu
1y
9
81
Introducing the AI Alignment Forum (FAQ)
habryka
4y
8
82
Thoughts on AGI organizations and capabilities work
Rob Bensinger
13d
17
69
Deconfusing Direct vs Amortised Optimization
beren
18d
6
80
Don't leave your fingerprints on the future
So8res
2mo
32
144
An Update on Academia vs. Industry (one year into my faculty job)
David Scott Krueger (formerly: capybaralet)
3mo
18
45
The Dumbest Possible Gets There First
Artaxerxes
4mo
7
23
Concrete Advice for Forming Inside Views on AI Safety
Neel Nanda
4mo
6
58
A survey of tool use and workflows in alignment research
Logan Riggs
9mo
5
18
Some ideas for epistles to the AI ethicists
Charlie Steiner
3mo
0
42
What technologies could cause world GDP doubling times to be <8 years?
Daniel Kokotajlo
2y
44
31
New paper: Corrigibility with Utility Preservation
Koen.Holtman
3y
11
73
AI x-risk reduction: why I chose academia over industry
David Scott Krueger (formerly: capybaralet)
1y
14
54
[Linkpost] Existential Risk Analysis in Empirical Research Papers
Dan H
5mo
0
23
Reading the ethicists 2: Hunting for AI alignment papers
Charlie Steiner
6mo
1
15
Techniques for optimizing worst-case performance
paulfchristiano
3y
12