Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
166 posts
AI Risk
Goodhart's Law
World Optimization
Threat Models
Instrumental Convergence
Corrigibility
Existential Risk
Coordination / Cooperation
Academic Papers
AI Safety Camp
Ethics & Morality
Treacherous Turn
689 posts
Newsletters
Logical Induction
Epistemology
SERI MATS
Logical Uncertainty
Intellectual Progress (Society-Level)
Practice & Philosophy of Science
AI Alignment Fieldbuilding
Distillation & Pedagogy
Bayes' Theorem
Postmortems & Retrospectives
Radical Probabilism
155
The next decades might be wild
Marius Hobbhahn
5d
21
39
AI Neorealism: a threat model & success criterion for existential safety
davidad
5d
0
94
Thoughts on AGI organizations and capabilities work
Rob Bensinger
13d
17
58
You can still fetch the coffee today if you're dead tomorrow
davidad
11d
15
336
Counterarguments to the basic AI x-risk case
KatjaGrace
2mo
122
103
AI will change the world, but won’t take it over by playing “3-dimensional chess”.
boazbarak
28d
86
48
Deconfusing Direct vs Amortised Optimization
beren
18d
6
724
AGI Ruin: A List of Lethalities
Eliezer Yudkowsky
6mo
653
36
Refining the Sharp Left Turn threat model, part 2: applying alignment techniques
Vika
25d
4
98
Niceness is unnatural
So8res
2mo
18
93
Don't leave your fingerprints on the future
So8res
2mo
32
144
Worlds Where Iterative Design Fails
johnswentworth
3mo
26
253
A central AI alignment problem: capabilities generalization, and the sharp left turn
So8res
6mo
48
134
AI coordination needs clear wins
evhub
3mo
15
124
Logical induction for software engineers
Alex Flint
17d
2
31
Reflections on the PIBBSS Fellowship 2022
Nora_Ammann
9d
0
207
Lessons learned from talking to >100 academics about AI safety
Marius Hobbhahn
2mo
16
161
Most People Start With The Same Few Bad Ideas
johnswentworth
3mo
30
119
Quintin's alignment papers roundup - week 1
Quintin Pope
3mo
5
135
Your posts should be on arXiv
JanBrauner
3mo
39
71
SERI MATS Program - Winter 2022 Cohort
Ryan Kidd
2mo
12
63
QAPR 4: Inductive biases
Quintin Pope
2mo
2
119
Conjecture: Internal Infohazard Policy
Connor Leahy
4mo
6
84
How to do theoretical research, a personal perspective
Mark Xu
4mo
4
60
Quintin's alignment papers roundup - week 2
Quintin Pope
3mo
2
192
Call For Distillers
johnswentworth
8mo
42
28
Auditing games for high-level interpretability
Paul Colognese
1mo
1
54
Methodological Therapy: An Agenda For Tackling Research Bottlenecks
adamShimi
2mo
6