Tree of Tags

Go Back

You can't go any further

You can't go any further

meritocratic regular democratic

hot top alive

16 posts Goal-Directedness

4 posts Literature Reviews

119 why assume AGIs will optimize for fixed goals?

nostalgebraist

6mo

52

92 wrapper-minds are the enemy

nostalgebraist

6mo

36

65 AI safety without goal-directed behavior

Rohin Shah

3y

15

55 Finding Goals in the World Model

Jeremy Gillen

4mo

8

52 When Most VNM-Coherent Preference Orderings Have Convergent Instrumental Incentives

TurnTrout

1y

4

51 Will humans build goal-directed agents?

Rohin Shah

3y

43

33 P₂B: Plan to P₂B Better

Ramana Kumar

1y

14

21 Goal-directed = Model-based RL?

adamShimi

2y

10

21 Behavioral Sufficient Statistics for Goal-Directedness

adamShimi

1y

12

19 Focus: you are allowed to be bad at accomplishing your goals

adamShimi

2y

17

19 Against the Backward Approach to Goal-Directedness

adamShimi

1y

6

16 Locality of goals

adamShimi

2y

8

14 Goal-Directedness and Behavior, Redux

adamShimi

1y

4

14 Goals and short descriptions

Michele Campolo

2y

8

164 2021 AI Alignment Literature Review and Charity Comparison

Larks

12mo

26

137 2020 AI Alignment Literature Review and Charity Comparison

Larks

1y

14

69 Literature Review on Goal-Directedness

adamShimi

1y

21

11 Some recent survey papers on (mostly near-term) AI safety, security, and assurance

Aryeh Englander

1y

0