Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
63 posts
Decision Theory
Utility Functions
Quantilization
Mild Optimization
Bounded Rationality
Coherence Arguments
Law-Thinking
Orthogonality Thesis
Coherent Extrapolated Volition
Indexical Information
20 posts
Goal-Directedness
Literature Reviews
147
Can you control the past?
Joe Carlsmith
1y
93
47
Take 7: You should talk about "the human's utility function" less.
Charlie Steiner
12d
22
24
Quantilizers and Generative Models
Adam Jermyn
5mo
5
55
Notes on "Can you control the past"
So8res
2mo
40
21
Exploring Mild Behaviour in Embedded Agents
Megan Kinniment
5mo
3
63
Three ways that "Sufficiently optimized agents appear coherent" can be false
Wei_Dai
3y
3
52
Buridan's ass in coordination games
jessicata
4y
26
14
Modal Bargaining Agents
orthonormal
7y
0
25
Quantilizers maximize expected utility subject to a conservative cost constraint
jessicata
7y
0
20
Another view of quantilizers: avoiding Goodhart's Law
jessicata
6y
1
24
In memoryless Cartesian environments, every UDT policy is a CDT+SIA policy
jessicata
6y
5
2
Thoughts on Quantilizers
Stuart_Armstrong
5y
0
14
Quantilal control for finite MDPs
Vanessa Kosoy
4y
0
69
The "Measuring Stick of Utility" Problem
johnswentworth
6mo
22
92
wrapper-minds are the enemy
nostalgebraist
6mo
36
6
Goal-directedness is behavioral, not structural
adamShimi
2y
12
51
Will humans build goal-directed agents?
Rohin Shah
3y
43
164
2021 AI Alignment Literature Review and Charity Comparison
Larks
12mo
26
33
P₂B: Plan to P₂B Better
Ramana Kumar
1y
14
11
Some recent survey papers on (mostly near-term) AI safety, security, and assurance
Aryeh Englander
1y
0
16
Locality of goals
adamShimi
2y
8
52
When Most VNM-Coherent Preference Orderings Have Convergent Instrumental Incentives
TurnTrout
1y
4
21
Goal-directed = Model-based RL?
adamShimi
2y
10
19
Focus: you are allowed to be bad at accomplishing your goals
adamShimi
2y
17
14
Goal-Directedness and Behavior, Redux
adamShimi
1y
4
19
Against the Backward Approach to Goal-Directedness
adamShimi
1y
6
69
Literature Review on Goal-Directedness
adamShimi
1y
21
14
Goals and short descriptions
Michele Campolo
2y
8