Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
94 posts
Rationality
Abstraction
Finite Factored Sets
Causality
Open Problems
Consequentialism
Filtered Evidence
Techniques
Consciousness
Intuition
Free Will
Adding Up to Normality
83 posts
Decision Theory
Goal-Directedness
Utility Functions
Literature Reviews
Quantilization
Mild Optimization
Coherence Arguments
Bounded Rationality
Orthogonality Thesis
Law-Thinking
Coherent Extrapolated Volition
Indexical Information
171
Finite Factored Sets in Pictures
Magdalena Wache
9d
29
32
Counterfactability
Scott Garrabrant
1mo
4
62
Builder/Breaker for Deconfusion
abramdemski
2mo
9
108
Principles for Alignment/Agency Projects
johnswentworth
5mo
20
69
All the posts I will never write
Alexander Gietelink Oldenziel
4mo
8
88
[Intro to brain-like-AGI safety] 15. Conclusion: Open problems, how to help, AMA
Steven Byrnes
7mo
11
137
What's Up With Confusingly Pervasive Consequentialism?
Raemon
11mo
88
71
Open Problems in AI X-Risk [PAIS #5]
Dan H
6mo
3
52
Distributed Decisions
johnswentworth
6mo
4
141
Finite Factored Sets
Scott Garrabrant
1y
94
114
Saving Time
Scott Garrabrant
1y
19
86
Testing The Natural Abstraction Hypothesis: Project Update
johnswentworth
1y
17
34
Exploring Finite Factored Sets with some toy examples
Thomas Kehrenberg
9mo
1
29
[ASoT] Searching for consequentialist structure
leogao
8mo
2
35
Take 7: You should talk about "the human's utility function" less.
Charlie Steiner
12d
22
46
Notes on "Can you control the past"
So8res
2mo
40
146
why assume AGIs will optimize for fixed goals?
nostalgebraist
6mo
52
60
Finding Goals in the World Model
Jeremy Gillen
4mo
8
91
wrapper-minds are the enemy
nostalgebraist
6mo
36
175
2021 AI Alignment Literature Review and Charity Comparison
Larks
12mo
26
160
Can you control the past?
Joe Carlsmith
1y
93
56
The "Measuring Stick of Utility" Problem
johnswentworth
6mo
22
32
Exploring Mild Behaviour in Embedded Agents
Megan Kinniment
5mo
3
137
2020 AI Alignment Literature Review and Charity Comparison
Larks
1y
14
20
Quantilizers and Generative Models
Adam Jermyn
5mo
5
130
An Orthodox Case Against Utility Functions
abramdemski
2y
53
74
Coherence arguments imply a force for goal-directed behavior
KatjaGrace
1y
27
206
Realism about rationality
Richard_Ngo
4y
145