Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
26 posts
Human Values
Shard Theory
Guilt & Shame
Internal Alignment (Human)
Scrupulosity
84 posts
Sex & Gender
Fun Theory
Complexity of Value
Coherent Extrapolated Volition
75
Shard Theory in Nine Theses: a Distillation and Critical Appraisal
LawrenceC
1d
9
44
Positive values seem more robust and lasting than prohibitions
TurnTrout
3d
9
92
Contra shard theory, in the context of the diamond maximizer problem
So8res
2mo
16
141
The shard theory of human values
Quintin Pope
3mo
57
133
Shard Theory: An Overview
David Udell
4mo
34
148
Humans provide an untapped wealth of evidence about alignment
TurnTrout
5mo
92
88
Human values & biases are inaccessible to the genome
TurnTrout
5mo
51
9
Questions about Value Lock-in, Paternalism, and Empowerment
Sam
1mo
2
32
Team Shard Status Report
David Udell
4mo
8
14
Signaling Guilt
Krieger
2mo
6
18
Silliness
lsusr
6mo
0
52
Normativity
abramdemski
2y
11
42
What AI Safety Researchers Have Written About the Nature of Human Values
avturchin
3y
3
26
Preference synthesis illustrated: Star Wars
Stuart_Armstrong
2y
8
49
General alignment properties
TurnTrout
4mo
2
66
Challenges to Yudkowsky's Pronoun Reform Proposal
Zack_M_Davis
9mo
55
53
[NSFW Review] Interspecies Reviewers
lsusr
8mo
8
71
Review of 'But exactly how complex and fragile?'
TurnTrout
1y
0
9
Playing Without Affordances
Alex Hollow
4mo
0
86
The two-layer model of human values, and problems with synthesizing preferences
Kaj_Sotala
2y
16
44
Sexual Dimorphism in Yudkowsky's Sequences, in Relation to My Gender Problems
Zack_M_Davis
1y
29
83
But exactly how complex and fragile?
KatjaGrace
3y
32
58
The Skewed and the Screwed: When Mating Meets Politics
Jacob Falkovich
2y
6
58
Masculine Virtues
Jacob Falkovich
3y
32
30
The Best Toy In The Park
jefftk
2y
15
29
Characterising utopia
Richard_Ngo
2y
3
26
Reversible changes: consider a bucket of water
Stuart_Armstrong
3y
18
31
Values Weren't Complex, Once.
Davidmanheim
4y
13