Go Back
Choose this branch
You can't go any further
meritocratic
regular
democratic
hot
top
alive
37 posts
Value Learning
Kolmogorov Complexity
5 posts
The Pointers Problem
40
Beyond Kolmogorov and Shannon
Alexander Gietelink Oldenziel
1mo
14
50
Different perspectives on concept extrapolation
Stuart_Armstrong
8mo
7
66
Parsing Chris Mingard on Neural Networks
Alex Flint
1y
27
30
How an alien theory of mind might be unlearnable
Stuart_Armstrong
11mo
35
20
Value extrapolation, concept extrapolation, model splintering
Stuart_Armstrong
9mo
1
54
Normativity
abramdemski
2y
11
21
Morally underdefined situations can be deadly
Stuart_Armstrong
1y
8
10
AIs should learn human preferences, not biases
Stuart_Armstrong
8mo
1
71
Clarifying "AI Alignment"
paulfchristiano
4y
82
62
Preface to the sequence on value learning
Rohin Shah
4y
6
39
Other versions of "No free lunch in value learning"
Stuart_Armstrong
2y
0
56
Conclusion to the sequence on value learning
Rohin Shah
3y
20
37
Using vector fields to visualise preferences and make them consistent
MichaelA
2y
32
28
Learning human preferences: black-box, white-box, and structured white-box access
Stuart_Armstrong
2y
9
53
Don't design agents which exploit adversarial inputs
TurnTrout
1mo
61
39
People care about each other even though they have imperfect motivational pointers?
TurnTrout
1mo
25
99
The Pointers Problem: Human Values Are A Function Of Humans' Latent Variables
johnswentworth
2y
43
23
Stable Pointers to Value II: Environmental Goals
abramdemski
4y
2
20
Stable Pointers to Value: An Agent Embedded in Its Own Utility Function
abramdemski
5y
9