Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
26 posts
Human Values
Shard Theory
Guilt & Shame
Internal Alignment (Human)
Scrupulosity
84 posts
Sex & Gender
Fun Theory
Complexity of Value
Coherent Extrapolated Volition
75
Shard Theory in Nine Theses: a Distillation and Critical Appraisal
LawrenceC
1d
9
44
Positive values seem more robust and lasting than prohibitions
TurnTrout
3d
9
141
The shard theory of human values
Quintin Pope
3mo
57
148
Humans provide an untapped wealth of evidence about alignment
TurnTrout
5mo
92
88
Human values & biases are inaccessible to the genome
TurnTrout
5mo
51
14
Signaling Guilt
Krieger
2mo
6
133
Shard Theory: An Overview
David Udell
4mo
34
92
Contra shard theory, in the context of the diamond maximizer problem
So8res
2mo
16
9
Questions about Value Lock-in, Paternalism, and Empowerment
Sam
1mo
2
32
Team Shard Status Report
David Udell
4mo
8
-3
What will happen when an all-reaching AGI starts attempting to fix human character flaws?
Michael Bright
6mo
6
4
Value Notion - Questions to Ask
aysajan
11mo
0
45
Not for the Sake of Selfishness Alone
lukeprog
11y
20
25
Inner Goodness
Eliezer Yudkowsky
14y
31
40
How to deal with someone in a LessWrong meeting being creepy
Douglas_Reay
10y
776
66
Challenges to Yudkowsky's Pronoun Reform Proposal
Zack_M_Davis
9mo
55
44
Prolegomena to a Theory of Fun
Eliezer Yudkowsky
14y
52
49
General alignment properties
TurnTrout
4mo
2
48
Anthropomorphic Optimism
Eliezer Yudkowsky
14y
59
53
[NSFW Review] Interspecies Reviewers
lsusr
8mo
8
67
Value is Fragile
Eliezer Yudkowsky
13y
111
97
The Hidden Complexity of Wishes
Eliezer Yudkowsky
15y
136
43
High Challenge
Eliezer Yudkowsky
14y
75
86
The two-layer model of human values, and problems with synthesizing preferences
Kaj_Sotala
2y
16
25
The Uses of Fun (Theory)
Eliezer Yudkowsky
13y
16
29
Characterising utopia
Richard_Ngo
2y
3
37
What's wrong with simplicity of value?
Wei_Dai
11y
40
30
The Best Toy In The Park
jefftk
2y
15