Go Back
You can't go any further
You can't go any further
meritocratic
regular
democratic
hot
top
alive
4 posts
The Pointers Problem
59 posts
Value Learning
115
The Pointers Problem: Human Values Are A Function Of Humans' Latent Variables
johnswentworth
2y
43
19
Stable Pointers to Value III: Recursive Quantilization
abramdemski
4y
4
13
Stable Pointers to Value II: Environmental Goals
abramdemski
4y
2
10
Stable Pointers to Value: An Agent Embedded in Its Own Utility Function
abramdemski
5y
9
23
Character alignment
p.b.
3mo
0
36
Different perspectives on concept extrapolation
Stuart_Armstrong
8mo
7
25
An Open Philanthropy grant proposal: Causal representation learning of human preferences
PabloAMC
11mo
6
23
How an alien theory of mind might be unlearnable
Stuart_Armstrong
11mo
35
11
The Pointers Problem - Distilled
NinaR
6mo
0
9
Value extrapolation vs Wireheading
Stuart_Armstrong
6mo
1
13
Natural Value Learning
Chris van Merwijk
9mo
10
11
AIs should learn human preferences, not biases
Stuart_Armstrong
8mo
1
80
The E-Coli Test for AI Alignment
johnswentworth
4y
24
77
Preface to the sequence on value learning
Rohin Shah
4y
6
47
Using vector fields to visualise preferences and make them consistent
MichaelA
2y
32
66
Why we need a *theory* of human values
Stuart_Armstrong
4y
15
14
Morally underdefined situations can be deadly
Stuart_Armstrong
1y
8
61
Humans can be assigned any values whatsoever…
Stuart_Armstrong
4y
26