Tree of Tags

Go Back

Choose this branch

You can't go any further

meritocratic regular democratic

hot top alive

37 posts Value Learning Kolmogorov Complexity

5 posts The Pointers Problem

40 Beyond Kolmogorov and Shannon

Alexander Gietelink Oldenziel

1mo

14

50 Different perspectives on concept extrapolation

Stuart_Armstrong

8mo

7

66 Parsing Chris Mingard on Neural Networks

Alex Flint

1y

27

30 How an alien theory of mind might be unlearnable

Stuart_Armstrong

11mo

35

20 Value extrapolation, concept extrapolation, model splintering

Stuart_Armstrong

9mo

1

54 Normativity

abramdemski

2y

11

21 Morally underdefined situations can be deadly

Stuart_Armstrong

1y

8

10 AIs should learn human preferences, not biases

Stuart_Armstrong

8mo

1

71 Clarifying "AI Alignment"

paulfchristiano

4y

82

62 Preface to the sequence on value learning

Rohin Shah

4y

6

39 Other versions of "No free lunch in value learning"

Stuart_Armstrong

2y

0

56 Conclusion to the sequence on value learning

Rohin Shah

3y

20

37 Using vector fields to visualise preferences and make them consistent

MichaelA

2y

32

28 Learning human preferences: black-box, white-box, and structured white-box access

Stuart_Armstrong

2y

9

53 Don't design agents which exploit adversarial inputs

TurnTrout

1mo

61

39 People care about each other even though they have imperfect motivational pointers?

TurnTrout

1mo

25

99 The Pointers Problem: Human Values Are A Function Of Humans' Latent Variables

johnswentworth

2y

43

23 Stable Pointers to Value II: Environmental Goals

abramdemski

4y

2

20 Stable Pointers to Value: An Agent Embedded in Its Own Utility Function

abramdemski

5y

9