Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
26 posts
Human Values
Shard Theory
Guilt & Shame
Internal Alignment (Human)
Scrupulosity
84 posts
Sex & Gender
Fun Theory
Complexity of Value
Coherent Extrapolated Volition
70
Shard Theory in Nine Theses: a Distillation and Critical Appraisal
LawrenceC
1d
9
42
Positive values seem more robust and lasting than prohibitions
TurnTrout
3d
9
202
The shard theory of human values
Quintin Pope
3mo
57
175
Humans provide an untapped wealth of evidence about alignment
TurnTrout
5mo
92
92
Human values & biases are inaccessible to the genome
TurnTrout
5mo
51
20
Signaling Guilt
Krieger
2mo
6
130
Shard Theory: An Overview
David Udell
4mo
34
84
Contra shard theory, in the context of the diamond maximizer problem
So8res
2mo
16
12
Questions about Value Lock-in, Paternalism, and Empowerment
Sam
1mo
2
38
Team Shard Status Report
David Udell
4mo
8
1
What will happen when an all-reaching AGI starts attempting to fix human character flaws?
Michael Bright
6mo
6
5
Value Notion - Questions to Ask
aysajan
11mo
0
34
Not for the Sake of Selfishness Alone
lukeprog
11y
20
21
Inner Goodness
Eliezer Yudkowsky
14y
31
31
How to deal with someone in a LessWrong meeting being creepy
Douglas_Reay
10y
776
56
Challenges to Yudkowsky's Pronoun Reform Proposal
Zack_M_Davis
9mo
55
48
Prolegomena to a Theory of Fun
Eliezer Yudkowsky
14y
52
46
General alignment properties
TurnTrout
4mo
2
65
Anthropomorphic Optimism
Eliezer Yudkowsky
14y
59
51
[NSFW Review] Interspecies Reviewers
lsusr
8mo
8
139
Value is Fragile
Eliezer Yudkowsky
13y
111
138
The Hidden Complexity of Wishes
Eliezer Yudkowsky
15y
136
56
High Challenge
Eliezer Yudkowsky
14y
75
69
The two-layer model of human values, and problems with synthesizing preferences
Kaj_Sotala
2y
16
22
The Uses of Fun (Theory)
Eliezer Yudkowsky
13y
16
26
Characterising utopia
Richard_Ngo
2y
3
29
What's wrong with simplicity of value?
Wei_Dai
11y
40
25
The Best Toy In The Park
jefftk
2y
15