Go Back
You can't go any further
You can't go any further
meritocratic
regular
democratic
hot
top
alive
16 posts
Goal-Directedness
4 posts
Literature Reviews
119
why assume AGIs will optimize for fixed goals?
nostalgebraist
6mo
52
92
wrapper-minds are the enemy
nostalgebraist
6mo
36
65
AI safety without goal-directed behavior
Rohin Shah
3y
15
55
Finding Goals in the World Model
Jeremy Gillen
4mo
8
52
When Most VNM-Coherent Preference Orderings Have Convergent Instrumental Incentives
TurnTrout
1y
4
51
Will humans build goal-directed agents?
Rohin Shah
3y
43
33
P₂B: Plan to P₂B Better
Ramana Kumar
1y
14
21
Goal-directed = Model-based RL?
adamShimi
2y
10
21
Behavioral Sufficient Statistics for Goal-Directedness
adamShimi
1y
12
19
Focus: you are allowed to be bad at accomplishing your goals
adamShimi
2y
17
19
Against the Backward Approach to Goal-Directedness
adamShimi
1y
6
16
Locality of goals
adamShimi
2y
8
14
Goal-Directedness and Behavior, Redux
adamShimi
1y
4
14
Goals and short descriptions
Michele Campolo
2y
8
164
2021 AI Alignment Literature Review and Charity Comparison
Larks
12mo
26
137
2020 AI Alignment Literature Review and Charity Comparison
Larks
1y
14
69
Literature Review on Goal-Directedness
adamShimi
1y
21
11
Some recent survey papers on (mostly near-term) AI safety, security, and assurance
Aryeh Englander
1y
0