Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
47 posts
Abstraction
Finite Factored Sets
Causality
Open Problems
Intuition
Consciousness
Free Will
Incentives
Adding Up to Normality
47 posts
Rationality
Filtered Evidence
Consequentialism
Techniques
Deontology
148
Finite Factored Sets in Pictures
Magdalena Wache
9d
29
137
Finite Factored Sets
Scott Garrabrant
1y
94
36
Counterfactability
Scott Garrabrant
1mo
4
81
[Intro to brain-like-AGI safety] 15. Conclusion: Open problems, how to help, AMA
Steven Byrnes
7mo
11
83
Testing The Natural Abstraction Hypothesis: Project Update
johnswentworth
1y
17
19
A critical agential account of free will, causation, and physics
jessicata
2y
10
32
Logical Representation of Causal Models
johnswentworth
2y
0
17
Is my result wrong? Maths vs intuition vs evolution in learning human preferences
Stuart_Armstrong
3y
11
59
Pointing to a Flower
johnswentworth
2y
18
15
Integrating Hidden Variables Improves Approximation
johnswentworth
2y
4
50
Open Problems in AI X-Risk [PAIS #5]
Dan H
6mo
3
38
[AN #163]: Using finite factored sets for causal and temporal inference
Rohin Shah
1y
0
35
The Indexing Problem
johnswentworth
2y
2
10
What is the subjective experience of free will for agents?
Gordon Seidoh Worley
2y
19
70
Builder/Breaker for Deconfusion
abramdemski
2mo
9
115
Principles for Alignment/Agency Projects
johnswentworth
5mo
20
52
All the posts I will never write
Alexander Gietelink Oldenziel
4mo
8
76
Learning Normativity: A Research Agenda
abramdemski
2y
18
32
Knowledge, manipulation, and free will
Stuart_Armstrong
2y
15
65
Distributed Decisions
johnswentworth
6mo
4
27
Non-poisonous cake: anthropic updates are normal
Stuart_Armstrong
1y
11
41
Do Sufficiently Advanced Agents Use Logic?
abramdemski
3y
11
28
Values Form a Shifting Landscape (and why you might care)
VojtaKovarik
2y
6
14
Learning Normativity: Language
Bunthut
1y
4
27
Learning human preferences: optimistic and pessimistic scenarios
Stuart_Armstrong
2y
6
29
Hiding Complexity
Rafael Harth
2y
14
17
Egan's Theorem?
johnswentworth
2y
12
14
What is a VNM stable set, really?
Nisan
1y
0