Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
63 posts
Decision Theory
Utility Functions
Quantilization
Mild Optimization
Bounded Rationality
Coherence Arguments
Law-Thinking
Orthogonality Thesis
Coherent Extrapolated Volition
Indexical Information
20 posts
Goal-Directedness
Literature Reviews
134
Can you control the past?
Joe Carlsmith
1y
93
59
Take 7: You should talk about "the human's utility function" less.
Charlie Steiner
12d
22
28
Quantilizers and Generative Models
Adam Jermyn
5mo
5
64
Notes on "Can you control the past"
So8res
2mo
40
10
Exploring Mild Behaviour in Embedded Agents
Megan Kinniment
5mo
3
78
Three ways that "Sufficiently optimized agents appear coherent" can be false
Wei_Dai
3y
3
66
Buridan's ass in coordination games
jessicata
4y
26
15
Modal Bargaining Agents
orthonormal
7y
0
32
Quantilizers maximize expected utility subject to a conservative cost constraint
jessicata
7y
0
26
Another view of quantilizers: avoiding Goodhart's Law
jessicata
6y
1
29
In memoryless Cartesian environments, every UDT policy is a CDT+SIA policy
jessicata
6y
5
2
Thoughts on Quantilizers
Stuart_Armstrong
5y
0
18
Quantilal control for finite MDPs
Vanessa Kosoy
4y
0
82
The "Measuring Stick of Utility" Problem
johnswentworth
6mo
22
93
wrapper-minds are the enemy
nostalgebraist
6mo
36
5
Goal-directedness is behavioral, not structural
adamShimi
2y
12
57
Will humans build goal-directed agents?
Rohin Shah
3y
43
153
2021 AI Alignment Literature Review and Charity Comparison
Larks
12mo
26
43
P₂B: Plan to P₂B Better
Ramana Kumar
1y
14
13
Some recent survey papers on (mostly near-term) AI safety, security, and assurance
Aryeh Englander
1y
0
17
Locality of goals
adamShimi
2y
8
69
When Most VNM-Coherent Preference Orderings Have Convergent Instrumental Incentives
TurnTrout
1y
4
21
Goal-directed = Model-based RL?
adamShimi
2y
10
20
Focus: you are allowed to be bad at accomplishing your goals
adamShimi
2y
17
17
Goal-Directedness and Behavior, Redux
adamShimi
1y
4
25
Against the Backward Approach to Goal-Directedness
adamShimi
1y
6
69
Literature Review on Goal-Directedness
adamShimi
1y
21
14
Goals and short descriptions
Michele Campolo
2y
8