Go Back
You can't go any further
You can't go any further
meritocratic
regular
democratic
hot
top
alive
16 posts
Goal-Directedness
4 posts
Literature Reviews
146
why assume AGIs will optimize for fixed goals?
nostalgebraist
6mo
52
91
wrapper-minds are the enemy
nostalgebraist
6mo
36
60
Finding Goals in the World Model
Jeremy Gillen
4mo
8
59
AI safety without goal-directed behavior
Rohin Shah
3y
15
45
Will humans build goal-directed agents?
Rohin Shah
3y
43
35
When Most VNM-Coherent Preference Orderings Have Convergent Instrumental Incentives
TurnTrout
1y
4
23
P₂B: Plan to P₂B Better
Ramana Kumar
1y
14
21
Goal-directed = Model-based RL?
adamShimi
2y
10
18
Focus: you are allowed to be bad at accomplishing your goals
adamShimi
2y
17
15
Locality of goals
adamShimi
2y
8
14
Goals and short descriptions
Michele Campolo
2y
8
13
Against the Backward Approach to Goal-Directedness
adamShimi
1y
6
13
Behavioral Sufficient Statistics for Goal-Directedness
adamShimi
1y
12
11
Goal-Directedness and Behavior, Redux
adamShimi
1y
4
175
2021 AI Alignment Literature Review and Charity Comparison
Larks
12mo
26
137
2020 AI Alignment Literature Review and Charity Comparison
Larks
1y
14
69
Literature Review on Goal-Directedness
adamShimi
1y
21
9
Some recent survey papers on (mostly near-term) AI safety, security, and assurance
Aryeh Englander
1y
0