Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
45 posts
Instrumental Convergence
Orthogonality Thesis
12 posts
Deconfusion
Gradient Hacking
Gradient Descent
171
Debate on Instrumental Convergence between LeCun, Russell, Bengio, Zador, and More
Ben Pace
3y
60
132
Sorting Pebbles Into Correct Heaps
Eliezer Yudkowsky
14y
109
125
Goal retention discussion with Eliezer
MaxTegmark
8y
26
112
Seeking Power is Often Convergently Instrumental in MDPs
TurnTrout
3y
38
98
Coherence arguments imply a force for goal-directed behavior
KatjaGrace
1y
27
64
Distinguishing claims about training vs deployment
Richard_Ngo
1y
30
59
Instrumental convergence is what makes general intelligence possible
tailcalled
1mo
11
57
You can still fetch the coffee today if you're dead tomorrow
davidad
11d
15
50
Clarifying Power-Seeking and Instrumental Convergence
TurnTrout
3y
7
49
Review of 'Debate on Instrumental Convergence between LeCun, Russell, Bengio, Zador, and More'
TurnTrout
1y
1
48
The Catastrophic Convergence Conjecture
TurnTrout
2y
15
42
P₂B: Plan to P₂B Better
Ramana Kumar
1y
14
40
General purpose intelligence: arguing the Orthogonality thesis
Stuart_Armstrong
10y
156
38
Empowerment is (almost) All We Need
jacob_cannell
1mo
43
97
Gradient hacking
evhub
3y
39
68
Gradient descent is not just more efficient genetic algorithms
leogao
1y
14
45
Looking Deeper at Deconfusion
adamShimi
1y
13
45
Applications for Deconfusing Goal-Directedness
adamShimi
1y
3
43
Thoughts on gradient hacking
Richard_Ngo
1y
12
30
Gradient hacking: definitions and examples
Richard_Ngo
5mo
1
29
Hypothesis: gradient descent prefers general circuits
Quintin Pope
10mo
26
21
Alex Turner's Research, Comprehensive Information Gathering
adamShimi
1y
3
16
Some real examples of gradient hacking
Oliver Sourbut
1y
8
15
Approaches to gradient hacking
adamShimi
1y
8
2
Why Do AI researchers Rate the Probability of Doom So Low?
Aorou
2mo
6
2
(Extremely) Naive Gradient Hacking Doesn't Work
ojorgensen
9h
0