Go Back
You can't go any further
Choose this branch
meritocratic
regular
democratic
hot
top
alive
16 posts
Myopia
12 posts
Self Fulfilling/Refuting Prophecies
Luck
40
Steering Behaviour: Testing for (Non-)Myopia in Language Models
Evan R. Murphy
15d
16
7
Generative, Episodic Objectives for Safe AI
Michael Glass
2mo
3
47
LCDT, A Myopic Decision Theory
adamShimi
1y
51
60
Partial Agency
abramdemski
3y
18
103
The Credit Assignment Problem
abramdemski
3y
40
63
Why GPT wants to mesa-optimize & how we might change this
John_Maxwell
2y
32
42
Towards a mechanistic understanding of corrigibility
evhub
3y
26
38
Acceptability Verification: A Research Agenda
David Udell
5mo
0
65
Arguments against myopic training
Richard_Ngo
2y
39
54
Open Problems with Myopia
Mark Xu
1y
16
35
Evan Hubinger on Homogeneity in Takeoff Speeds, Learned Optimization and Interpretability
Michaƫl Trazzi
1y
0
2
Limiting an AGI's Context Temporally
EulersApprentice
3y
11
24
The Dualist Predict-O-Matic ($100 prize)
John_Maxwell
3y
35
54
AI safety via market making
evhub
2y
45
30
Random Thoughts on Predict-O-Matic
abramdemski
3y
3
45
An example of self-fulfilling spurious proofs in UDT
cousin_it
10y
43
14
Luck II: Expecting White Swans
fowlertm
9y
87
35
Self-Supervised Learning and AGI Safety
Steven Byrnes
3y
9
19
Encouragement to Instill Confidence?
Zvi
10mo
5
-4
Can chess be a game of luck?
Rune
13y
44
174
Self-fulfilling correlations
PhilGoetz
12y
50
16
Omega and self-fulfilling prophecies
RichardKennaway
11y
19
2
Self-fulfilling values of time
KatjaGrace
5y
0
29
Do the 'unlucky' systematically underestimate high-variance strategies?
MBlume
13y
5
55
Fifty Shades of Self-Fulfilling Prophecy
PhilGoetz
8y
87
19
Self-Fulfilling Prophecies Aren't Always About Self-Awareness
John_Maxwell
3y
7