Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
41 posts
Impact Regularization
Exercises / Problem-Sets
Fixed Point Theorems
84 posts
World Modeling
Anthropics
AIXI
Updateless Decision Theory
Sleeping Beauty Paradox
Cognitive Science
Extraterrestrial Life
Economics
Grabby Aliens
Infinity
65
Attainable Utility Preservation: Empirical Results
TurnTrout
2y
8
20
Using modal fixed points to formalize logical causality
cousin_it
5y
0
37
Attainable Utility Preservation: Concepts
TurnTrout
2y
20
30
Iteration Fixed Point Exercises
Scott Garrabrant
4y
12
71
Attainable Utility Theory: Why Things Matter
TurnTrout
3y
24
33
Overcoming Clinginess in Impact Measures
TurnTrout
4y
9
34
Diagonalization Fixed Point Exercises
Scott Garrabrant
4y
23
57
Worrying about the Vase: Whitelisting
TurnTrout
4y
26
21
[AN #68]: The attainable utility theory of impact
Rohin Shah
3y
0
36
When wishful thinking works
AlexMennen
4y
1
30
How Low Should Fruit Hang Before We Pick It?
TurnTrout
2y
9
17
Appendix: mathematics of indexical impact measures
Stuart_Armstrong
2y
0
77
Deducing Impact
TurnTrout
3y
26
26
Attainable Utility Preservation: Scaling to Superhuman
TurnTrout
2y
21
48
Open technical problem: A Quinean proof of Löb's theorem, for an easier cartoon guide
Andrew_Critch
26d
34
28
Traps of Formalization in Deconfusion
adamShimi
1y
7
75
«Boundaries», Part 3a: Defining boundaries as directed Markov blankets
Andrew_Critch
1mo
13
239
Without specific countermeasures, the easiest path to transformative AI likely leads to AI takeover
Ajeya Cotra
5mo
89
67
Humans do acausal coordination all the time
Adam Jermyn
1mo
36
84
Less Threat-Dependent Bargaining Solutions?? (3/2)
Diffractor
4mo
7
159
Testing The Natural Abstraction Hypothesis: Project Intro
johnswentworth
1y
34
573
Where I agree and disagree with Eliezer
paulfchristiano
6mo
205
17
Deliberation Everywhere: Simple Examples
Oliver Sourbut
5mo
0
11
An implementation of modal UDT
Benya_Fallenstein
7y
0
0
Corrigibility for AIXI via double indifference
Stuart_Armstrong
6y
0
13
Updatelessness and Son of X
Scott Garrabrant
6y
0
1
The Doomsday argument in anthropic decision theory
Stuart_Armstrong
5y
0
18
UDT as a Nash Equilibrium
cousin_it
4y
17