Go Back
Choose this branch
You can't go any further
meritocratic
regular
democratic
hot
top
alive
37 posts
Value Learning
Kolmogorov Complexity
5 posts
The Pointers Problem
71
Clarifying "AI Alignment"
paulfchristiano
4y
82
66
Parsing Chris Mingard on Neural Networks
Alex Flint
1y
27
62
Preface to the sequence on value learning
Rohin Shah
4y
6
56
Conclusion to the sequence on value learning
Rohin Shah
3y
20
54
What is ambitious value learning?
Rohin Shah
4y
28
54
Humans can be assigned any values whatsoever…
Stuart_Armstrong
4y
26
54
Policy Alignment
abramdemski
4y
25
54
Normativity
abramdemski
2y
11
53
Intuitions about goal-directed behavior
Rohin Shah
4y
15
52
The easy goal inference problem is still hard
paulfchristiano
4y
19
50
Different perspectives on concept extrapolation
Stuart_Armstrong
8mo
7
45
Future directions for ambitious value learning
Rohin Shah
4y
9
40
Beyond Kolmogorov and Shannon
Alexander Gietelink Oldenziel
1mo
14
39
Other versions of "No free lunch in value learning"
Stuart_Armstrong
2y
0
99
The Pointers Problem: Human Values Are A Function Of Humans' Latent Variables
johnswentworth
2y
43
53
Don't design agents which exploit adversarial inputs
TurnTrout
1mo
61
39
People care about each other even though they have imperfect motivational pointers?
TurnTrout
1mo
25
23
Stable Pointers to Value II: Environmental Goals
abramdemski
4y
2
20
Stable Pointers to Value: An Agent Embedded in Its Own Utility Function
abramdemski
5y
9