Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
69 posts
Debate (AI safety technique)
Factored Cognition
Experiments
Ought
AI-assisted Alignment
Memory and Mnemonics
Air Conditioning
43 posts
Iterated Amplification
Humans Consulting HCH
118
Godzilla Strategies
johnswentworth
6mo
65
116
Supervise Process, not Outcomes
stuhlmueller
8mo
8
107
Solving Math Problems by Relay
bgold
2y
26
106
Preregistration: Air Conditioner Test
johnswentworth
8mo
64
102
Imitative Generalisation (AKA 'Learning the Prior')
Beth Barnes
1y
14
91
Writeup: Progress on AI Safety via Debate
Beth Barnes
2y
18
84
Rant on Problem Factorization for Alignment
johnswentworth
4mo
48
80
Air Conditioner Test Results & Discussion
johnswentworth
6mo
38
79
Beliefs and Disagreements about Automating Alignment Research
Ian McKenzie
3mo
4
77
Why I'm excited about Debate
Richard_Ngo
1y
12
76
Ought: why it matters and ways to help
paulfchristiano
3y
7
67
Experiment: a good researcher is hard to find
gwern
10y
21
66
Three mental images from thinking about AGI debate & corrigibility
Steven Byrnes
2y
35
63
Vaniver's View on Factored Cognition
Vaniver
3y
4
132
Debate update: Obfuscated arguments problem
Beth Barnes
1y
21
117
My Understanding of Paul Christiano's Iterated Amplification AI Safety Research Agenda
Chi Nguyen
2y
21
111
Paul's research agenda FAQ
zhukeepa
4y
73
85
Model splintering: moving from one imperfect model to another
Stuart_Armstrong
2y
10
68
Garrabrant and Shah on human modeling in AGI
Rob Bensinger
1y
10
61
Relaxed adversarial training for inner alignment
evhub
3y
28
60
Relating HCH and Logical Induction
abramdemski
2y
4
56
Directions and desiderata for AI alignment
paulfchristiano
3y
1
50
Notes on OpenAI’s alignment plan
Alex Flint
12d
5
48
HCH Speculation Post #2A
Charlie Steiner
1y
7
47
My confusions with Paul's Agenda
Vaniver
4y
1
45
HCH is not just Mechanical Turk
William_S
3y
6
45
Understanding Iterated Distillation and Amplification: Claims and Oversight
William_S
4y
30
44
Iterated Distillation and Amplification
Ajeya Cotra
4y
13