Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

94 posts Rationality Abstraction Finite Factored Sets Causality Open Problems Consequentialism Filtered Evidence Techniques Consciousness Intuition Free Will Adding Up to Normality

83 posts Decision Theory Goal-Directedness Utility Functions Literature Reviews Quantilization Mild Optimization Coherence Arguments Bounded Rationality Orthogonality Thesis Law-Thinking Coherent Extrapolated Volition Indexical Information

125 Finite Factored Sets in Pictures

Magdalena Wache

9d

29

78 Builder/Breaker for Deconfusion

abramdemski

2mo

9

133 Finite Factored Sets

Scott Garrabrant

1y

94

40 Counterfactability

Scott Garrabrant

1mo

4

122 Principles for Alignment/Agency Projects

johnswentworth

5mo

20

35 All the posts I will never write

Alexander Gietelink Oldenziel

4mo

8

74 [Intro to brain-like-AGI safety] 15. Conclusion: Open problems, how to help, AMA

Steven Byrnes

7mo

11

80 Testing The Natural Abstraction Hypothesis: Project Update

johnswentworth

1y

17

25 A critical agential account of free will, causation, and physics

jessicata

2y

10

38 Logical Representation of Causal Models

johnswentworth

2y

0

92 Learning Normativity: A Research Agenda

abramdemski

2y

18

22 Is my result wrong? Maths vs intuition vs evolution in learning human preferences

Stuart_Armstrong

3y

11

31 Knowledge, manipulation, and free will

Stuart_Armstrong

2y

15

78 Distributed Decisions

johnswentworth

6mo

4

93 wrapper-minds are the enemy

nostalgebraist

6mo

36

134 Can you control the past?

Joe Carlsmith

1y

93

59 Take 7: You should talk about "the human's utility function" less.

Charlie Steiner

12d

22

28 Quantilizers and Generative Models

Adam Jermyn

5mo

5

64 Notes on "Can you control the past"

So8res

2mo

40

10 Exploring Mild Behaviour in Embedded Agents

Megan Kinniment

5mo

3

78 Three ways that "Sufficiently optimized agents appear coherent" can be false

Wei_Dai

3y

3

66 Buridan's ass in coordination games

jessicata

4y

26

15 Modal Bargaining Agents

orthonormal

7y

0

32 Quantilizers maximize expected utility subject to a conservative cost constraint

jessicata

7y

0

26 Another view of quantilizers: avoiding Goodhart's Law

jessicata

6y

1

29 In memoryless Cartesian environments, every UDT policy is a CDT+SIA policy

jessicata

6y

5

2 Thoughts on Quantilizers

Stuart_Armstrong

5y

0

18 Quantilal control for finite MDPs

Vanessa Kosoy

4y

0