Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
26 posts
Human Values
Shard Theory
Guilt & Shame
Internal Alignment (Human)
Scrupulosity
84 posts
Sex & Gender
Fun Theory
Complexity of Value
Coherent Extrapolated Volition
65
Shard Theory in Nine Theses: a Distillation and Critical Appraisal
LawrenceC
1d
9
40
Positive values seem more robust and lasting than prohibitions
TurnTrout
3d
9
263
The shard theory of human values
Quintin Pope
3mo
57
202
Humans provide an untapped wealth of evidence about alignment
TurnTrout
5mo
92
96
Human values & biases are inaccessible to the genome
TurnTrout
5mo
51
26
Signaling Guilt
Krieger
2mo
6
127
Shard Theory: An Overview
David Udell
4mo
34
76
Contra shard theory, in the context of the diamond maximizer problem
So8res
2mo
16
15
Questions about Value Lock-in, Paternalism, and Empowerment
Sam
1mo
2
44
Team Shard Status Report
David Udell
4mo
8
5
What will happen when an all-reaching AGI starts attempting to fix human character flaws?
Michael Bright
6mo
6
6
Value Notion - Questions to Ask
aysajan
11mo
0
23
Not for the Sake of Selfishness Alone
lukeprog
11y
20
17
Inner Goodness
Eliezer Yudkowsky
14y
31
22
How to deal with someone in a LessWrong meeting being creepy
Douglas_Reay
10y
776
46
Challenges to Yudkowsky's Pronoun Reform Proposal
Zack_M_Davis
9mo
55
52
Prolegomena to a Theory of Fun
Eliezer Yudkowsky
14y
52
43
General alignment properties
TurnTrout
4mo
2
82
Anthropomorphic Optimism
Eliezer Yudkowsky
14y
59
49
[NSFW Review] Interspecies Reviewers
lsusr
8mo
8
211
Value is Fragile
Eliezer Yudkowsky
13y
111
179
The Hidden Complexity of Wishes
Eliezer Yudkowsky
15y
136
69
High Challenge
Eliezer Yudkowsky
14y
75
52
The two-layer model of human values, and problems with synthesizing preferences
Kaj_Sotala
2y
16
19
The Uses of Fun (Theory)
Eliezer Yudkowsky
13y
16
23
Characterising utopia
Richard_Ngo
2y
3
21
What's wrong with simplicity of value?
Wei_Dai
11y
40
20
The Best Toy In The Park
jefftk
2y
15