Go Back
You can't go any further
Choose this branch
meritocratic
regular
democratic
hot
top
alive
16 posts
Myopia
12 posts
Self Fulfilling/Refuting Prophecies
Luck
37
Steering Behaviour: Testing for (Non-)Myopia in Language Models
Evan R. Murphy
15d
16
11
Generative, Episodic Objectives for Safe AI
Michael Glass
2mo
3
50
LCDT, A Myopic Decision Theory
adamShimi
1y
51
58
Partial Agency
abramdemski
3y
18
91
The Credit Assignment Problem
abramdemski
3y
40
55
Why GPT wants to mesa-optimize & how we might change this
John_Maxwell
2y
32
44
Towards a mechanistic understanding of corrigibility
evhub
3y
26
43
Acceptability Verification: A Research Agenda
David Udell
5mo
0
56
Arguments against myopic training
Richard_Ngo
2y
39
57
Open Problems with Myopia
Mark Xu
1y
16
28
Evan Hubinger on Homogeneity in Takeoff Speeds, Learned Optimization and Interpretability
Michaƫl Trazzi
1y
0
5
Limiting an AGI's Context Temporally
EulersApprentice
3y
11
16
The Dualist Predict-O-Matic ($100 prize)
John_Maxwell
3y
35
55
AI safety via market making
evhub
2y
45
31
Random Thoughts on Predict-O-Matic
abramdemski
3y
3
33
An example of self-fulfilling spurious proofs in UDT
cousin_it
10y
43
10
Luck II: Expecting White Swans
fowlertm
9y
87
29
Self-Supervised Learning and AGI Safety
Steven Byrnes
3y
9
16
Encouragement to Instill Confidence?
Zvi
10mo
5
-3
Can chess be a game of luck?
Rune
13y
44
144
Self-fulfilling correlations
PhilGoetz
12y
50
12
Omega and self-fulfilling prophecies
RichardKennaway
11y
19
1
Self-fulfilling values of time
KatjaGrace
5y
0
25
Do the 'unlucky' systematically underestimate high-variance strategies?
MBlume
13y
5
38
Fifty Shades of Self-Fulfilling Prophecy
PhilGoetz
8y
87
14
Self-Fulfilling Prophecies Aren't Always About Self-Awareness
John_Maxwell
3y
7