Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
344 posts
Research Agendas
Value Learning
Reinforcement Learning
Embedded Agency
Suffering
AI Capabilities
Agency
Animal Welfare
Inverse Reinforcement Learning
Risks of Astronomical Suffering (S-risks)
Wireheading
Robust Agents
14230 posts
Decision Theory
Utility Functions
Counterfactuals
Goal-Directedness
Nutrition
Newcomb's Problem
VNM Theorem
Updateless Decision Theory
Timeless Decision Theory
Literature Reviews
Functional Decision Theory
Counterfactual Mugging
281
Is AI Progress Impossible To Predict?
alyssavance
7mo
38
249
Humans are very reliable agents
alyssavance
6mo
35
218
Reward is not the optimization target
TurnTrout
4mo
97
216
On how various plans miss the hard bits of the alignment challenge
So8res
5mo
81
194
EfficientZero: How It Works
1a3orn
1y
42
158
Are wireheads happy?
Scott Alexander
12y
107
147
Introduction to Cartesian Frames
Scott Garrabrant
2y
29
146
Some conceptual alignment research projects
Richard_Ngo
3mo
14
139
EfficientZero: human ALE sample-efficiency w/MuZero+self-supervised
gwern
1y
52
129
Embedded Agents
abramdemski
4y
41
127
Demand offsetting
paulfchristiano
1y
38
117
Our take on CHAI’s research agenda in under 1500 words
Alex Flint
2y
19
111
"Just Suffer Until It Passes"
lionhearted
4y
26
105
Wirehead your Chickens
shminux
4y
53
177
Impossibility results for unbounded utilities
paulfchristiano
10mo
104
140
Saving Time
Scott Garrabrant
1y
19
138
Decision theory does not imply that we get to have nice things
So8res
2mo
53
131
Humans are utility monsters
PhilGoetz
9y
217
128
2020 AI Alignment Literature Review and Charity Comparison
Larks
1y
14
127
How I Lost 100 Pounds Using TDT
Zvi
11y
244
124
Can you control the past?
Joe Carlsmith
1y
93
119
An Orthodox Case Against Utility Functions
abramdemski
2y
53
118
Decision Theories: A Less Wrong Primer
orthonormal
10y
174
117
Pinpointing Utility
9y
156
117
The genie knows, but doesn't care
Rob Bensinger
9y
519
117
Coherent decisions imply consistent utilities
Eliezer Yudkowsky
3y
81
116
Decision Theory FAQ
lukeprog
9y
484
113
Degrees of Freedom
sarahconstantin
3y
31