Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
344 posts
Research Agendas
Value Learning
Reinforcement Learning
Embedded Agency
Suffering
AI Capabilities
Agency
Animal Welfare
Inverse Reinforcement Learning
Risks of Astronomical Suffering (S-risks)
Wireheading
Robust Agents
14230 posts
Decision Theory
Utility Functions
Counterfactuals
Goal-Directedness
Nutrition
Newcomb's Problem
VNM Theorem
Updateless Decision Theory
Timeless Decision Theory
Literature Reviews
Functional Decision Theory
Counterfactual Mugging
13
Note on algorithms with multiple trained components
Steven Byrnes
6h
1
34
My AGI safety research—2022 review, ’23 plans
Steven Byrnes
6d
6
71
When AI solves a game, focus on the game's mechanics, not its theme.
Cleo Nardo
27d
7
17
Riffing on the agent type
Quinn
12d
0
218
Reward is not the optimization target
TurnTrout
4mo
97
35
A Short Dialogue on the Meaning of Reward Functions
Leon Lang
1mo
0
39
Will we run out of ML data? Evidence from projecting dataset size trends
Pablo Villalobos
1mo
12
216
On how various plans miss the hard bits of the alignment challenge
So8res
5mo
81
146
Some conceptual alignment research projects
Richard_Ngo
3mo
14
249
Humans are very reliable agents
alyssavance
6mo
35
281
Is AI Progress Impossible To Predict?
alyssavance
7mo
38
10
Can GPT-3 Write Contra Dances?
jefftk
16d
0
16
generalized wireheading
carado
1mo
7
16
LLMs may capture key components of human agency
catubc
1mo
0
46
K-complexity is silly; use cross-entropy instead
So8res
1h
4
58
Take 7: You should talk about "the human's utility function" less.
Charlie Steiner
12d
22
25
How can one literally buy time (from x-risk) with money?
Alex_Altair
7d
3
31
"Attention Passengers": not for Signs
jefftk
13d
10
16
Using Obsidian if you're used to using Roam
Solenoid_Entity
9d
4
138
Decision theory does not imply that we get to have nice things
So8res
2mo
53
14
Ponzi schemes can be highly profitable if your timing is good
GeneSmith
8d
18
66
Humans do acausal coordination all the time
Adam Jermyn
1mo
36
62
Notes on "Can you control the past"
So8res
2mo
40
6
Join the AI Testing Hackathon this Friday
Esben Kran
8d
0
28
Two New Newcomb Variants
eva_
1mo
22
24
SBF x LoL
NicholasKross
1mo
6
16
Less Successful Cider Adventures
jefftk
25d
1
8
EA & LW Forums Weekly Summary (28th Nov - 4th Dec 22')
Zoe Williams
14d
1