Go Back
Choose this branch
You can't go any further
meritocratic
regular
democratic
hot
top
alive
37 posts
Value Learning
Kolmogorov Complexity
5 posts
The Pointers Problem
80
Beyond Kolmogorov and Shannon
Alexander Gietelink Oldenziel
1mo
14
74
Preface to the sequence on value learning
Rohin Shah
4y
6
68
Parsing Chris Mingard on Neural Networks
Alex Flint
1y
27
58
Humans can be assigned any values whatsoever…
Stuart_Armstrong
4y
26
57
Clarifying "AI Alignment"
paulfchristiano
4y
82
51
Intuitions about goal-directed behavior
Rohin Shah
4y
15
48
The easy goal inference problem is still hard
paulfchristiano
4y
19
47
Future directions for ambitious value learning
Rohin Shah
4y
9
46
Policy Alignment
abramdemski
4y
25
45
Using vector fields to visualise preferences and make them consistent
MichaelA
2y
32
44
What is ambitious value learning?
Rohin Shah
4y
28
42
Conclusion to the sequence on value learning
Rohin Shah
3y
20
38
Normativity
abramdemski
2y
11
34
Different perspectives on concept extrapolation
Stuart_Armstrong
8mo
7
109
The Pointers Problem: Human Values Are A Function Of Humans' Latent Variables
johnswentworth
2y
43
67
Don't design agents which exploit adversarial inputs
TurnTrout
1mo
61
25
People care about each other even though they have imperfect motivational pointers?
TurnTrout
1mo
25
13
Stable Pointers to Value II: Environmental Goals
abramdemski
4y
2
10
Stable Pointers to Value: An Agent Embedded in Its Own Utility Function
abramdemski
5y
9