Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
26 posts
Human Values
Shard Theory
Guilt & Shame
Internal Alignment (Human)
Scrupulosity
84 posts
Sex & Gender
Fun Theory
Complexity of Value
Coherent Extrapolated Volition
202
The shard theory of human values
Quintin Pope
3mo
57
175
Humans provide an untapped wealth of evidence about alignment
TurnTrout
5mo
92
130
Shard Theory: An Overview
David Udell
4mo
34
92
Human values & biases are inaccessible to the genome
TurnTrout
5mo
51
84
Contra shard theory, in the context of the diamond maximizer problem
So8res
2mo
16
70
Shard Theory in Nine Theses: a Distillation and Critical Appraisal
LawrenceC
1d
9
50
What AI Safety Researchers Have Written About the Nature of Human Values
avturchin
3y
3
46
Normativity
abramdemski
2y
11
42
Positive values seem more robust and lasting than prohibitions
TurnTrout
3d
9
39
The Parable Of The Talents
Scott Alexander
7y
10
38
Team Shard Status Report
David Udell
4mo
8
34
Not for the Sake of Selfishness Alone
lukeprog
11y
20
33
Nate Soares' Replacing Guilt Series compiled in epub Format
lifelonglearner
5y
9
27
Human values differ as much as values can differ
PhilGoetz
12y
220
139
Value is Fragile
Eliezer Yudkowsky
13y
111
138
The Hidden Complexity of Wishes
Eliezer Yudkowsky
15y
136
132
Joy in the Merely Real
Eliezer Yudkowsky
14y
43
95
The Gift We Give To Tomorrow
Eliezer Yudkowsky
14y
99
85
Interpersonal Entanglement
Eliezer Yudkowsky
13y
167
73
31 Laws of Fun
Eliezer Yudkowsky
13y
36
73
But exactly how complex and fragile?
KatjaGrace
3y
32
69
The two-layer model of human values, and problems with synthesizing preferences
Kaj_Sotala
2y
16
69
The Fun Theory Sequence
Eliezer Yudkowsky
13y
30
69
Sayeth the Girl
Alicorn
13y
503
65
Anthropomorphic Optimism
Eliezer Yudkowsky
14y
59
62
A Rationalist's Account of Objectification?
lukeprog
11y
327
61
The Skewed and the Screwed: When Mating Meets Politics
Jacob Falkovich
2y
6
60
Complexity of Value ≠ Complexity of Outcome
Wei_Dai
12y
232