Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
63 posts
Decision Theory
Utility Functions
Quantilization
Mild Optimization
Bounded Rationality
Coherence Arguments
Law-Thinking
Orthogonality Thesis
Coherent Extrapolated Volition
Indexical Information
20 posts
Goal-Directedness
Literature Reviews
35
Take 7: You should talk about "the human's utility function" less.
Charlie Steiner
12d
22
46
Notes on "Can you control the past"
So8res
2mo
40
160
Can you control the past?
Joe Carlsmith
1y
93
56
The "Measuring Stick of Utility" Problem
johnswentworth
6mo
22
32
Exploring Mild Behaviour in Embedded Agents
Megan Kinniment
5mo
3
20
Quantilizers and Generative Models
Adam Jermyn
5mo
5
130
An Orthodox Case Against Utility Functions
abramdemski
2y
53
74
Coherence arguments imply a force for goal-directed behavior
KatjaGrace
1y
27
206
Realism about rationality
Richard_Ngo
4y
145
28
Inferring utility functions from locally non-transitive preferences
Jan
10mo
15
133
Decision Theory
abramdemski
4y
46
99
Utility ≠ Reward
vlad_m
3y
25
96
A Critique of Functional Decision Theory
wdmacaskill
3y
54
90
Comparison of decision theories (with a focus on logical-counterfactual decision theories)
riceissa
3y
11
146
why assume AGIs will optimize for fixed goals?
nostalgebraist
6mo
52
60
Finding Goals in the World Model
Jeremy Gillen
4mo
8
91
wrapper-minds are the enemy
nostalgebraist
6mo
36
175
2021 AI Alignment Literature Review and Charity Comparison
Larks
12mo
26
137
2020 AI Alignment Literature Review and Charity Comparison
Larks
1y
14
69
Literature Review on Goal-Directedness
adamShimi
1y
21
35
When Most VNM-Coherent Preference Orderings Have Convergent Instrumental Incentives
TurnTrout
1y
4
23
P₂B: Plan to P₂B Better
Ramana Kumar
1y
14
59
AI safety without goal-directed behavior
Rohin Shah
3y
15
45
Will humans build goal-directed agents?
Rohin Shah
3y
43
11
Goal-Directedness and Behavior, Redux
adamShimi
1y
4
13
Behavioral Sufficient Statistics for Goal-Directedness
adamShimi
1y
12
21
Goal-directed = Model-based RL?
adamShimi
2y
10
13
Against the Backward Approach to Goal-Directedness
adamShimi
1y
6