Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

94 posts Rationality Abstraction Finite Factored Sets Causality Open Problems Consequentialism Filtered Evidence Techniques Consciousness Intuition Free Will Adding Up to Normality

83 posts Decision Theory Goal-Directedness Utility Functions Literature Reviews Quantilization Mild Optimization Coherence Arguments Bounded Rationality Orthogonality Thesis Law-Thinking Coherent Extrapolated Volition Indexical Information

171 Finite Factored Sets in Pictures

Magdalena Wache

9d

29

32 Counterfactability

Scott Garrabrant

1mo

4

62 Builder/Breaker for Deconfusion

abramdemski

2mo

9

108 Principles for Alignment/Agency Projects

johnswentworth

5mo

20

69 All the posts I will never write

Alexander Gietelink Oldenziel

4mo

8

88 [Intro to brain-like-AGI safety] 15. Conclusion: Open problems, how to help, AMA

Steven Byrnes

7mo

11

137 What's Up With Confusingly Pervasive Consequentialism?

Raemon

11mo

88

71 Open Problems in AI X-Risk [PAIS #5]

Dan H

6mo

3

52 Distributed Decisions

johnswentworth

6mo

4

141 Finite Factored Sets

Scott Garrabrant

1y

94

114 Saving Time

Scott Garrabrant

1y

19

86 Testing The Natural Abstraction Hypothesis: Project Update

johnswentworth

1y

17

34 Exploring Finite Factored Sets with some toy examples

Thomas Kehrenberg

9mo

1

29 [ASoT] Searching for consequentialist structure

leogao

8mo

2

35 Take 7: You should talk about "the human's utility function" less.

Charlie Steiner

12d

22

46 Notes on "Can you control the past"

So8res

2mo

40

146 why assume AGIs will optimize for fixed goals?

nostalgebraist

6mo

52

60 Finding Goals in the World Model

Jeremy Gillen

4mo

8

91 wrapper-minds are the enemy

nostalgebraist

6mo

36

175 2021 AI Alignment Literature Review and Charity Comparison

Larks

12mo

26

160 Can you control the past?

Joe Carlsmith

1y

93

56 The "Measuring Stick of Utility" Problem

johnswentworth

6mo

22

32 Exploring Mild Behaviour in Embedded Agents

Megan Kinniment

5mo

3

137 2020 AI Alignment Literature Review and Charity Comparison

Larks

1y

14

20 Quantilizers and Generative Models

Adam Jermyn

5mo

5

130 An Orthodox Case Against Utility Functions

abramdemski

2y

53

74 Coherence arguments imply a force for goal-directed behavior

KatjaGrace

1y

27

206 Realism about rationality

Richard_Ngo

4y

145