Go Back
You can't go any further
Choose this branch
meritocratic
regular
democratic
hot
top
alive
15 posts
Human Values
11 posts
Shard Theory
Guilt & Shame
Internal Alignment (Human)
Scrupulosity
175
Humans provide an untapped wealth of evidence about alignment
TurnTrout
5mo
92
92
Human values & biases are inaccessible to the genome
TurnTrout
5mo
51
12
Questions about Value Lock-in, Paternalism, and Empowerment
Sam
1mo
2
1
What will happen when an all-reaching AGI starts attempting to fix human character flaws?
Michael Bright
6mo
6
5
Value Notion - Questions to Ask
aysajan
11mo
0
34
Not for the Sake of Selfishness Alone
lukeprog
11y
20
21
Inner Goodness
Eliezer Yudkowsky
14y
31
50
What AI Safety Researchers Have Written About the Nature of Human Values
avturchin
3y
3
13
Ends: An Introduction
Rob Bensinger
7y
0
19
Preference synthesis illustrated: Star Wars
Stuart_Armstrong
2y
8
27
Human values differ as much as values can differ
PhilGoetz
12y
220
10
Modeling humans: what's the point?
Charlie Steiner
2y
1
27
Thought experiment: coarse-grained VR utopia
cousin_it
5y
48
18
Silliness
lsusr
6mo
0
70
Shard Theory in Nine Theses: a Distillation and Critical Appraisal
LawrenceC
1d
9
42
Positive values seem more robust and lasting than prohibitions
TurnTrout
3d
9
202
The shard theory of human values
Quintin Pope
3mo
57
20
Signaling Guilt
Krieger
2mo
6
130
Shard Theory: An Overview
David Udell
4mo
34
84
Contra shard theory, in the context of the diamond maximizer problem
So8res
2mo
16
38
Team Shard Status Report
David Udell
4mo
8
33
Nate Soares' Replacing Guilt Series compiled in epub Format
lifelonglearner
5y
9
12
Typical Minding Guilt/Shame
Unreal
5y
2
-5
Rhythm 0 and the Absolution of Responsibility
Precious Oluwatobi Emmanuel
1y
0
39
The Parable Of The Talents
Scott Alexander
7y
10