Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
94 posts
Rationality
Abstraction
Finite Factored Sets
Causality
Open Problems
Consequentialism
Filtered Evidence
Techniques
Consciousness
Intuition
Free Will
Adding Up to Normality
83 posts
Decision Theory
Goal-Directedness
Utility Functions
Literature Reviews
Quantilization
Mild Optimization
Coherence Arguments
Bounded Rationality
Orthogonality Thesis
Law-Thinking
Coherent Extrapolated Volition
Indexical Information
169
What's Up With Confusingly Pervasive Consequentialism?
Raemon
11mo
88
148
Finite Factored Sets in Pictures
Magdalena Wache
9d
29
137
Finite Factored Sets
Scott Garrabrant
1y
94
130
Saving Time
Scott Garrabrant
1y
19
115
Principles for Alignment/Agency Projects
johnswentworth
5mo
20
113
Problem relaxation as a tactic
TurnTrout
2y
8
85
Thinking About Filtered Evidence Is (Very!) Hard
abramdemski
2y
29
83
Testing The Natural Abstraction Hypothesis: Project Update
johnswentworth
1y
17
82
Public Static: What is Abstraction?
johnswentworth
2y
18
81
[Intro to brain-like-AGI safety] 15. Conclusion: Open problems, how to help, AMA
Steven Byrnes
7mo
11
76
Learning Normativity: A Research Agenda
abramdemski
2y
18
76
Writing Causal Models Like We Write Programs
johnswentworth
2y
8
70
Search-in-Territory vs Search-in-Map
johnswentworth
1y
13
70
Builder/Breaker for Deconfusion
abramdemski
2mo
9
180
Realism about rationality
Richard_Ngo
4y
145
164
2021 AI Alignment Literature Review and Charity Comparison
Larks
12mo
26
147
Can you control the past?
Joe Carlsmith
1y
93
137
2020 AI Alignment Literature Review and Charity Comparison
Larks
1y
14
128
An Orthodox Case Against Utility Functions
abramdemski
2y
53
119
why assume AGIs will optimize for fixed goals?
nostalgebraist
6mo
52
114
Decision Theory
abramdemski
4y
46
102
Utility ≠ Reward
vlad_m
3y
25
101
Coherence arguments do not entail goal-directed behavior
Rohin Shah
4y
69
92
wrapper-minds are the enemy
nostalgebraist
6mo
36
88
Coherence arguments imply a force for goal-directed behavior
KatjaGrace
1y
27
81
A Critique of Functional Decision Theory
wdmacaskill
3y
54
77
Troll Bridge
abramdemski
3y
59
69
The "Measuring Stick of Utility" Problem
johnswentworth
6mo
22