Tree of Tags

Go Back

You can't go any further

Choose this branch

meritocratic regular democratic

hot top alive

16 posts Myopia

12 posts Self Fulfilling/Refuting Prophecies Luck

37 Steering Behaviour: Testing for (Non-)Myopia in Language Models

Evan R. Murphy

15d

16

11 Generative, Episodic Objectives for Safe AI

Michael Glass

2mo

3

50 LCDT, A Myopic Decision Theory

adamShimi

1y

51

58 Partial Agency

abramdemski

3y

18

91 The Credit Assignment Problem

abramdemski

3y

40

55 Why GPT wants to mesa-optimize & how we might change this

John_Maxwell

2y

32

44 Towards a mechanistic understanding of corrigibility

evhub

3y

26

43 Acceptability Verification: A Research Agenda

David Udell

5mo

0

56 Arguments against myopic training

Richard_Ngo

2y

39

57 Open Problems with Myopia

Mark Xu

1y

16

28 Evan Hubinger on Homogeneity in Takeoff Speeds, Learned Optimization and Interpretability

Michaël Trazzi

1y

0

5 Limiting an AGI's Context Temporally

EulersApprentice

3y

11

16 The Dualist Predict-O-Matic ($100 prize)

John_Maxwell

3y

35

55 AI safety via market making

evhub

2y

45

31 Random Thoughts on Predict-O-Matic

abramdemski

3y

3

33 An example of self-fulfilling spurious proofs in UDT

cousin_it

10y

43

10 Luck II: Expecting White Swans

fowlertm

9y

87

29 Self-Supervised Learning and AGI Safety

Steven Byrnes

3y

9

16 Encouragement to Instill Confidence?

Zvi

10mo

5

-3 Can chess be a game of luck?

Rune

13y

44

144 Self-fulfilling correlations

PhilGoetz

12y

50

12 Omega and self-fulfilling prophecies

RichardKennaway

11y

19

1 Self-fulfilling values of time

KatjaGrace

5y

0

25 Do the 'unlucky' systematically underestimate high-variance strategies?

MBlume

13y

5

38 Fifty Shades of Self-Fulfilling Prophecy

PhilGoetz

8y

87

14 Self-Fulfilling Prophecies Aren't Always About Self-Awareness

John_Maxwell

3y

7