Go Back
You can't go any further
You can't go any further
meritocratic
regular
democratic
hot
top
alive
4 posts
The Pointers Problem
59 posts
Value Learning
104
The Pointers Problem: Human Values Are A Function Of Humans' Latent Variables
johnswentworth
2y
43
19
Stable Pointers to Value III: Recursive Quantilization
abramdemski
4y
4
18
Stable Pointers to Value II: Environmental Goals
abramdemski
4y
2
15
Stable Pointers to Value: An Agent Embedded in Its Own Utility Function
abramdemski
5y
9
76
The Urgent Meta-Ethics of Friendly Artificial Intelligence
lukeprog
11y
252
69
The E-Coli Test for AI Alignment
johnswentworth
4y
24
68
Preface to the sequence on value learning
Rohin Shah
4y
6
65
Why we need a *theory* of human values
Stuart_Armstrong
4y
15
64
Clarifying "AI Alignment"
paulfchristiano
4y
82
58
Where do selfish values come from?
Wei_Dai
11y
62
56
Humans can be assigned any values whatsoever…
Stuart_Armstrong
4y
26
52
Intuitions about goal-directed behavior
Rohin Shah
4y
15
50
The easy goal inference problem is still hard
paulfchristiano
4y
19
49
What is ambitious value learning?
Rohin Shah
4y
28
49
Conclusion to the sequence on value learning
Rohin Shah
3y
20
46
Future directions for ambitious value learning
Rohin Shah
4y
9
42
Different perspectives on concept extrapolation
Stuart_Armstrong
8mo
7
41
Using vector fields to visualise preferences and make them consistent
MichaelA
2y
32