Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
166 posts
AI Risk
Missing Moods
Seed AI
29 posts
Threat Models
Sharp Left Turn
Multipolar Scenarios
Technological Unemployment
77
AI Safety Seems Hard to Measure
HoldenKarnofsky
12d
5
455
Counterarguments to the basic AI x-risk case
KatjaGrace
2mo
122
64
Who are some prominent reasonable people who are confident that AI won't kill everyone?
Optimization Process
15d
40
1039
Where I agree and disagree with Eliezer
paulfchristiano
6mo
205
113
AI will change the world, but won’t take it over by playing “3-dimensional chess”.
boazbarak
28d
86
1043
AGI Ruin: A List of Lethalities
Eliezer Yudkowsky
6mo
653
103
Meta AI announces Cicero: Human-Level Diplomacy play (with dialogue)
Jacy Reese Anthis
28d
64
117
Am I secretly excited for AI getting weird?
porby
1mo
4
28
Apply to attend winter AI alignment workshops (Dec 28-30 & Jan 3-5) near Berkeley
Akash
19d
1
21
Aligned Behavior is not Evidence of Alignment Past a Certain Level of Intelligence
Ronny Fernandez
15d
5
87
Niceness is unnatural
So8res
2mo
18
148
Worlds Where Iterative Design Fails
johnswentworth
3mo
26
15
Race to the Top: Benchmarks for AI Safety
Isabella Duan
16d
2
170
AGI ruin scenarios are likely (and disjunctive)
So8res
4mo
37
42
AI Neorealism: a threat model & success criterion for existential safety
davidad
5d
0
148
Clarifying AI X-risk
zac_kenton
1mo
23
14
How is the "sharp left turn defined"?
Chris_Leong
12d
3
68
Threat Model Literature Review
zac_kenton
1mo
4
31
Refining the Sharp Left Turn threat model, part 2: applying alignment techniques
Vika
25d
4
309
A central AI alignment problem: capabilities generalization, and the sharp left turn
So8res
6mo
48
47
AI X-risk >35% mostly based on a recent peer-reviewed argument
michaelcohen
1mo
31
165
AI Could Defeat All Of Us Combined
HoldenKarnofsky
6mo
29
87
Refining the Sharp Left Turn threat model, part 1: claims and mechanisms
Vika
4mo
3
70
We may be able to see sharp left turns coming
Ethan Perez
3mo
26
33
It matters when the first sharp left turn happens
Adam Jermyn
2mo
9
266
What Multipolar Failure Looks Like, and Robust Agent-Agnostic Processes (RAAPs)
Andrew_Critch
1y
60
253
Another (outer) alignment failure story
paulfchristiano
1y
38
437
What failure looks like
paulfchristiano
3y
49