Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
47 posts
Abstraction
Finite Factored Sets
Causality
Open Problems
Intuition
Consciousness
Free Will
Incentives
Adding Up to Normality
47 posts
Rationality
Filtered Evidence
Consequentialism
Techniques
Deontology
171
Finite Factored Sets in Pictures
Magdalena Wache
9d
29
141
Finite Factored Sets
Scott Garrabrant
1y
94
32
Counterfactability
Scott Garrabrant
1mo
4
88
[Intro to brain-like-AGI safety] 15. Conclusion: Open problems, how to help, AMA
Steven Byrnes
7mo
11
86
Testing The Natural Abstraction Hypothesis: Project Update
johnswentworth
1y
17
13
A critical agential account of free will, causation, and physics
jessicata
2y
10
26
Logical Representation of Causal Models
johnswentworth
2y
0
12
Is my result wrong? Maths vs intuition vs evolution in learning human preferences
Stuart_Armstrong
3y
11
63
Pointing to a Flower
johnswentworth
2y
18
11
Integrating Hidden Variables Improves Approximation
johnswentworth
2y
4
71
Open Problems in AI X-Risk [PAIS #5]
Dan H
6mo
3
26
[AN #163]: Using finite factored sets for causal and temporal inference
Rohin Shah
1y
0
24
The Indexing Problem
johnswentworth
2y
2
7
What is the subjective experience of free will for agents?
Gordon Seidoh Worley
2y
19
62
Builder/Breaker for Deconfusion
abramdemski
2mo
9
108
Principles for Alignment/Agency Projects
johnswentworth
5mo
20
69
All the posts I will never write
Alexander Gietelink Oldenziel
4mo
8
60
Learning Normativity: A Research Agenda
abramdemski
2y
18
33
Knowledge, manipulation, and free will
Stuart_Armstrong
2y
15
52
Distributed Decisions
johnswentworth
6mo
4
18
Non-poisonous cake: anthropic updates are normal
Stuart_Armstrong
1y
11
37
Do Sufficiently Advanced Agents Use Logic?
abramdemski
3y
11
31
Values Form a Shifting Landscape (and why you might care)
VojtaKovarik
2y
6
11
Learning Normativity: Language
Bunthut
1y
4
24
Learning human preferences: optimistic and pessimistic scenarios
Stuart_Armstrong
2y
6
23
Hiding Complexity
Rafael Harth
2y
14
14
Egan's Theorem?
johnswentworth
2y
12
8
What is a VNM stable set, really?
Nisan
1y
0