Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

63 posts Decision Theory Utility Functions Quantilization Mild Optimization Bounded Rationality Coherence Arguments Law-Thinking Orthogonality Thesis Coherent Extrapolated Volition Indexical Information

20 posts Goal-Directedness Literature Reviews

35 Take 7: You should talk about "the human's utility function" less.

Charlie Steiner

12d

22

46 Notes on "Can you control the past"

So8res

2mo

40

160 Can you control the past?

Joe Carlsmith

1y

93

56 The "Measuring Stick of Utility" Problem

johnswentworth

6mo

22

32 Exploring Mild Behaviour in Embedded Agents

Megan Kinniment

5mo

3

20 Quantilizers and Generative Models

Adam Jermyn

5mo

5

130 An Orthodox Case Against Utility Functions

abramdemski

2y

53

74 Coherence arguments imply a force for goal-directed behavior

KatjaGrace

1y

27

206 Realism about rationality

Richard_Ngo

4y

145

28 Inferring utility functions from locally non-transitive preferences

Jan

10mo

15

133 Decision Theory

abramdemski

4y

46

99 Utility ≠ Reward

vlad_m

3y

25

96 A Critique of Functional Decision Theory

wdmacaskill

3y

54

90 Comparison of decision theories (with a focus on logical-counterfactual decision theories)

riceissa

3y

11

146 why assume AGIs will optimize for fixed goals?

nostalgebraist

6mo

52

60 Finding Goals in the World Model

Jeremy Gillen

4mo

8

91 wrapper-minds are the enemy

nostalgebraist

6mo

36

175 2021 AI Alignment Literature Review and Charity Comparison

Larks

12mo

26

137 2020 AI Alignment Literature Review and Charity Comparison

Larks

1y

14

69 Literature Review on Goal-Directedness

adamShimi

1y

21

35 When Most VNM-Coherent Preference Orderings Have Convergent Instrumental Incentives

TurnTrout

1y

4

23 P₂B: Plan to P₂B Better

Ramana Kumar

1y

14

59 AI safety without goal-directed behavior

Rohin Shah

3y

15

45 Will humans build goal-directed agents?

Rohin Shah

3y

43

11 Goal-Directedness and Behavior, Redux

adamShimi

1y

4

13 Behavioral Sufficient Statistics for Goal-Directedness

adamShimi

1y

12

21 Goal-directed = Model-based RL?

adamShimi

2y

10

13 Against the Backward Approach to Goal-Directedness

adamShimi

1y

6