Tree of Tags

Go Back

Choose this branch

You can't go any further

meritocratic regular democratic

hot top alive

37 posts Value Learning Kolmogorov Complexity

5 posts The Pointers Problem

80 Beyond Kolmogorov and Shannon

Alexander Gietelink Oldenziel

1mo

14

74 Preface to the sequence on value learning

Rohin Shah

4y

6

68 Parsing Chris Mingard on Neural Networks

Alex Flint

1y

27

58 Humans can be assigned any values whatsoever…

Stuart_Armstrong

4y

26

57 Clarifying "AI Alignment"

paulfchristiano

4y

82

51 Intuitions about goal-directed behavior

Rohin Shah

4y

15

48 The easy goal inference problem is still hard

paulfchristiano

4y

19

47 Future directions for ambitious value learning

Rohin Shah

4y

9

46 Policy Alignment

abramdemski

4y

25

45 Using vector fields to visualise preferences and make them consistent

MichaelA

2y

32

44 What is ambitious value learning?

Rohin Shah

4y

28

42 Conclusion to the sequence on value learning

Rohin Shah

3y

20

38 Normativity

abramdemski

2y

11

34 Different perspectives on concept extrapolation

Stuart_Armstrong

8mo

7

109 The Pointers Problem: Human Values Are A Function Of Humans' Latent Variables

johnswentworth

2y

43

67 Don't design agents which exploit adversarial inputs

TurnTrout

1mo

61

25 People care about each other even though they have imperfect motivational pointers?

TurnTrout

1mo

25

13 Stable Pointers to Value II: Environmental Goals

abramdemski

4y

2

10 Stable Pointers to Value: An Agent Embedded in Its Own Utility Function

abramdemski

5y

9