Tree of Tags

Go Back

You can't go any further

You can't go any further

meritocratic regular democratic

hot top alive

16 posts Goal-Directedness

4 posts Literature Reviews

93 wrapper-minds are the enemy

nostalgebraist

6mo

36

92 why assume AGIs will optimize for fixed goals?

nostalgebraist

6mo

52

50 Finding Goals in the World Model

Jeremy Gillen

4mo

8

69 When Most VNM-Coherent Preference Orderings Have Convergent Instrumental Incentives

TurnTrout

1y

4

43 P₂B: Plan to P₂B Better

Ramana Kumar

1y

14

29 Behavioral Sufficient Statistics for Goal-Directedness

adamShimi

1y

12

71 AI safety without goal-directed behavior

Rohin Shah

3y

15

17 Goal-Directedness and Behavior, Redux

adamShimi

1y

4

25 Against the Backward Approach to Goal-Directedness

adamShimi

1y

6

57 Will humans build goal-directed agents?

Rohin Shah

3y

43

20 Focus: you are allowed to be bad at accomplishing your goals

adamShimi

2y

17

21 Goal-directed = Model-based RL?

adamShimi

2y

10

17 Locality of goals

adamShimi

2y

8

14 Goals and short descriptions

Michele Campolo

2y

8

153 2021 AI Alignment Literature Review and Charity Comparison

Larks

12mo

26

137 2020 AI Alignment Literature Review and Charity Comparison

Larks

1y

14

69 Literature Review on Goal-Directedness

adamShimi

1y

21

13 Some recent survey papers on (mostly near-term) AI safety, security, and assurance

Aryeh Englander

1y

0