Go Back
Choose this branch
You can't go any further
meritocratic
regular
democratic
hot
top
alive
105 posts
Optimization
General Intelligence
AI Services (CAIS)
Adaptation Executors
Superstimuli
Narrow AI
Selection vs Control
Hope
Delegation
43 posts
Goodhart's Law
57
Humans aren't fitness maximizers
So8res
2mo
45
20
Take 6: CAIS is actually Orwellian.
Charlie Steiner
13d
5
13
The reward function is already how well you manipulate humans
Kerry
2mo
9
89
What's General-Purpose Search, And Why Might We Expect To See It In Trained ML Systems?
johnswentworth
4mo
15
51
Measuring Optimization Power
Eliezer Yudkowsky
14y
35
79
"Normal" is the equilibrium state of past optimization processes
Alex_Altair
1mo
5
61
Vingean Agency
abramdemski
3mo
13
19
Are Intelligence and Generality Orthogonal?
cubefox
5mo
16
19
Is General Intelligence "Compact"?
DragonGod
5mo
6
159
Utility Maximization = Description Length Minimization
johnswentworth
1y
40
14
I No Longer Believe Intelligence to be "Magical"
DragonGod
6mo
34
19
[Yann Lecun] A Path Towards Autonomous Machine Intelligence
DragonGod
5mo
12
60
Ngo and Yudkowsky on scientific reasoning and pivotal acts
Eliezer Yudkowsky
10mo
13
16
program searches
carado
3mo
2
41
Don't align agents to evaluations of plans
TurnTrout
24d
46
48
Don't design agents which exploit adversarial inputs
TurnTrout
1mo
61
55
Alignment allows "nonrobust" decision-influences and doesn't require robust grading
TurnTrout
21d
27
40
Leto among the Machines
Virgil Kurkjian
4y
20
50
Signaling isn't about signaling, it's about Goodhart
Valentine
11mo
31
84
Moral Mazes and Short Termism
Zvi
3y
21
18
Reducing Goodhart: Announcement, Executive Summary
Charlie Steiner
4mo
0
37
Re-introducing Selection vs Control for Optimization (Optimizing and Goodhart Effects - Clarifying Thoughts, Part 1)
Davidmanheim
3y
5
5
The Three Levels of Goodhart's Curse
Scott Garrabrant
4y
0
31
Competent Preferences
Charlie Steiner
1y
2
21
(Some?) Possible Multi-Agent Goodhart Interactions
Davidmanheim
4y
2
50
Specification gaming examples in AI
Vika
4y
9
32
What does Optimization Mean, Again? (Optimizing and Goodhart Effects - Clarifying Thoughts, Part 2)
Davidmanheim
3y
7
30
Religion as Goodhart
shminux
3y
6