Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
94 posts
Rationality
Abstraction
Finite Factored Sets
Causality
Open Problems
Consequentialism
Filtered Evidence
Techniques
Consciousness
Intuition
Free Will
Adding Up to Normality
83 posts
Decision Theory
Goal-Directedness
Utility Functions
Literature Reviews
Quantilization
Mild Optimization
Coherence Arguments
Bounded Rationality
Orthogonality Thesis
Law-Thinking
Coherent Extrapolated Volition
Indexical Information
125
Finite Factored Sets in Pictures
Magdalena Wache
9d
29
78
Builder/Breaker for Deconfusion
abramdemski
2mo
9
133
Finite Factored Sets
Scott Garrabrant
1y
94
40
Counterfactability
Scott Garrabrant
1mo
4
122
Principles for Alignment/Agency Projects
johnswentworth
5mo
20
35
All the posts I will never write
Alexander Gietelink Oldenziel
4mo
8
74
[Intro to brain-like-AGI safety] 15. Conclusion: Open problems, how to help, AMA
Steven Byrnes
7mo
11
80
Testing The Natural Abstraction Hypothesis: Project Update
johnswentworth
1y
17
25
A critical agential account of free will, causation, and physics
jessicata
2y
10
38
Logical Representation of Causal Models
johnswentworth
2y
0
92
Learning Normativity: A Research Agenda
abramdemski
2y
18
22
Is my result wrong? Maths vs intuition vs evolution in learning human preferences
Stuart_Armstrong
3y
11
31
Knowledge, manipulation, and free will
Stuart_Armstrong
2y
15
78
Distributed Decisions
johnswentworth
6mo
4
93
wrapper-minds are the enemy
nostalgebraist
6mo
36
134
Can you control the past?
Joe Carlsmith
1y
93
59
Take 7: You should talk about "the human's utility function" less.
Charlie Steiner
12d
22
28
Quantilizers and Generative Models
Adam Jermyn
5mo
5
64
Notes on "Can you control the past"
So8res
2mo
40
10
Exploring Mild Behaviour in Embedded Agents
Megan Kinniment
5mo
3
78
Three ways that "Sufficiently optimized agents appear coherent" can be false
Wei_Dai
3y
3
66
Buridan's ass in coordination games
jessicata
4y
26
15
Modal Bargaining Agents
orthonormal
7y
0
32
Quantilizers maximize expected utility subject to a conservative cost constraint
jessicata
7y
0
26
Another view of quantilizers: avoiding Goodhart's Law
jessicata
6y
1
29
In memoryless Cartesian environments, every UDT policy is a CDT+SIA policy
jessicata
6y
5
2
Thoughts on Quantilizers
Stuart_Armstrong
5y
0
18
Quantilal control for finite MDPs
Vanessa Kosoy
4y
0