Go Back
You can't go any further
You can't go any further
meritocratic
regular
democratic
hot
top
alive
16 posts
Goal-Directedness
4 posts
Literature Reviews
93
wrapper-minds are the enemy
nostalgebraist
6mo
36
92
why assume AGIs will optimize for fixed goals?
nostalgebraist
6mo
52
50
Finding Goals in the World Model
Jeremy Gillen
4mo
8
69
When Most VNM-Coherent Preference Orderings Have Convergent Instrumental Incentives
TurnTrout
1y
4
43
P₂B: Plan to P₂B Better
Ramana Kumar
1y
14
29
Behavioral Sufficient Statistics for Goal-Directedness
adamShimi
1y
12
71
AI safety without goal-directed behavior
Rohin Shah
3y
15
17
Goal-Directedness and Behavior, Redux
adamShimi
1y
4
25
Against the Backward Approach to Goal-Directedness
adamShimi
1y
6
57
Will humans build goal-directed agents?
Rohin Shah
3y
43
20
Focus: you are allowed to be bad at accomplishing your goals
adamShimi
2y
17
21
Goal-directed = Model-based RL?
adamShimi
2y
10
17
Locality of goals
adamShimi
2y
8
14
Goals and short descriptions
Michele Campolo
2y
8
153
2021 AI Alignment Literature Review and Charity Comparison
Larks
12mo
26
137
2020 AI Alignment Literature Review and Charity Comparison
Larks
1y
14
69
Literature Review on Goal-Directedness
adamShimi
1y
21
13
Some recent survey papers on (mostly near-term) AI safety, security, and assurance
Aryeh Englander
1y
0