Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
45 posts
Instrumental Convergence
Orthogonality Thesis
12 posts
Deconfusion
Gradient Hacking
Gradient Descent
57
You can still fetch the coffee today if you're dead tomorrow
davidad
11d
15
59
Instrumental convergence is what makes general intelligence possible
tailcalled
1mo
11
33
A caveat to the Orthogonality Thesis
Wuschel Schulz
1mo
10
4
Assessing the Capabilities of ChatGPT through Success Rates
Zachary Robertson
7d
0
38
Empowerment is (almost) All We Need
jacob_cannell
1mo
43
22
Instrumental convergence in single-agent systems
Edouard Harris
2mo
4
2
The Opportunity and Risks of Learning Human Values In-Context
Zachary Robertson
10d
4
17
Misalignment-by-default in multi-agent systems
Edouard Harris
2mo
8
10
Is the Orthogonality Thesis true for humans?
Noosphere89
1mo
18
10
Instrumental convergence: scale and physical interactions
Edouard Harris
2mo
0
98
Coherence arguments imply a force for goal-directed behavior
KatjaGrace
1y
27
171
Debate on Instrumental Convergence between LeCun, Russell, Bengio, Zador, and More
Ben Pace
3y
60
3
[ASoT] Instrumental convergence is useful
Ulisse Mini
1mo
9
42
P₂B: Plan to P₂B Better
Ramana Kumar
1y
14
2
(Extremely) Naive Gradient Hacking Doesn't Work
ojorgensen
9h
0
30
Gradient hacking: definitions and examples
Richard_Ngo
5mo
1
68
Gradient descent is not just more efficient genetic algorithms
leogao
1y
14
29
Hypothesis: gradient descent prefers general circuits
Quintin Pope
10mo
26
43
Thoughts on gradient hacking
Richard_Ngo
1y
12
45
Applications for Deconfusing Goal-Directedness
adamShimi
1y
3
45
Looking Deeper at Deconfusion
adamShimi
1y
13
97
Gradient hacking
evhub
3y
39
16
Some real examples of gradient hacking
Oliver Sourbut
1y
8
21
Alex Turner's Research, Comprehensive Information Gathering
adamShimi
1y
3
15
Approaches to gradient hacking
adamShimi
1y
8
2
Why Do AI researchers Rate the Probability of Doom So Low?
Aorou
2mo
6