Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
1014 posts
AI
AI Timelines
Value Learning
AI Takeoff
Embedded Agency
Community
Eliciting Latent Knowledge (ELK)
Reinforcement Learning
Infra-Bayesianism
Counterfactuals
Logic & Mathematics
Interviews
111 posts
Iterated Amplification
Game Theory
Factored Cognition
Humans Consulting HCH
Research Agendas
Ought
Debate (AI safety technique)
Risks of Astronomical Suffering (S-risks)
Center on Long-Term Risk (CLR)
Mechanism Design
Fairness
Group Rationality
503
(My understanding of) What Everyone in Technical Alignment is Doing and Why
Thomas Larsen
3mo
83
486
What 2026 looks like
Daniel Kokotajlo
1y
98
409
Discussion with Eliezer Yudkowsky on AGI interventions
Rob Bensinger
1y
257
334
EfficientZero: How It Works
1a3orn
1y
42
315
Two-year update on my personal AI timelines
Ajeya Cotra
4mo
60
297
Why Agent Foundations? An Overly Abstract Explanation
johnswentworth
9mo
54
296
Are we in an AI overhang?
Andy Jones
2y
109
276
Fun with +12 OOMs of Compute
Daniel Kokotajlo
1y
78
271
Reward is not the optimization target
TurnTrout
4mo
97
265
The Plan
johnswentworth
1y
77
265
Ngo and Yudkowsky on alignment difficulty
Eliezer Yudkowsky
1y
143
265
An overview of 11 proposals for building safe advanced AI
evhub
2y
36
263
DeepMind: Generally capable agents emerge from open-ended play
Daniel Kokotajlo
1y
53
258
Draft report on AI timelines
Ajeya Cotra
2y
56
285
On how various plans miss the hard bits of the alignment challenge
So8res
5mo
81
181
Some conceptual alignment research projects
Richard_Ngo
3mo
14
174
Unifying Bargaining Notions (1/2)
Diffractor
4mo
38
143
The Commitment Races problem
Daniel Kokotajlo
3y
39
132
Paul's research agenda FAQ
zhukeepa
4y
73
125
«Boundaries», Part 1: a key missing concept from utility theory
Andrew_Critch
4mo
26
121
Thoughts on Human Models
Ramana Kumar
3y
32
114
My Understanding of Paul Christiano's Iterated Amplification AI Safety Research Agenda
Chi Nguyen
2y
21
114
Supervise Process, not Outcomes
stuhlmueller
8mo
8
111
Debate update: Obfuscated arguments problem
Beth Barnes
1y
21
102
Our take on CHAI’s research agenda in under 1500 words
Alex Flint
2y
19
93
Ought: why it matters and ways to help
paulfchristiano
3y
7
91
Writeup: Progress on AI Safety via Debate
Beth Barnes
2y
18
84
Unifying Bargaining Notions (2/2)
Diffractor
4mo
11