Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
1446 posts
AI
Interpretability (ML & AI)
AI Timelines
GPT
Research Agendas
Value Learning
AI Takeoff
Conjecture (org)
Embedded Agency
Machine Learning (ML)
Eliciting Latent Knowledge (ELK)
Community
118 posts
Inner Alignment
Optimization
Solomonoff Induction
Predictive Processing
Selection vs Control
Neocortex
Mesa-Optimization
Neuroscience
Priors
AI Services (CAIS)
Occam's Razor
General Intelligence
472
Simulators
janus
3mo
103
369
What 2026 looks like
Daniel Kokotajlo
1y
98
364
chinchilla's wild implications
nostalgebraist
4mo
114
364
DeepMind alignment team opinions on AGI ruin arguments
Vika
4mo
34
344
(My understanding of) What Everyone in Technical Alignment is Doing and Why
Thomas Larsen
3mo
83
338
A Mechanistic Interpretability Analysis of Grokking
Neel Nanda
4mo
39
325
Discussion with Eliezer Yudkowsky on AGI interventions
Rob Bensinger
1y
257
291
The Parable of Predict-O-Matic
abramdemski
3y
42
287
Two-year update on my personal AI timelines
Ajeya Cotra
4mo
60
273
EfficientZero: How It Works
1a3orn
1y
42
265
A challenge for AGI organizations, and a challenge for readers
Rob Bensinger
19d
30
258
On how various plans miss the hard bits of the alignment challenge
So8res
5mo
81
255
Are we in an AI overhang?
Andy Jones
2y
109
252
Reward is not the optimization target
TurnTrout
4mo
97
217
The ground of optimization
Alex Flint
2y
74
175
Inner Alignment: Explain like I'm 12 Edition
Rafael Harth
2y
46
166
Risks from Learned Optimization: Introduction
evhub
3y
42
148
The Solomonoff Prior is Malign
Mark Xu
2y
52
147
Matt Botvinick on the spontaneous emergence of learning algorithms
Adam Scholl
2y
87
144
My computational framework for the brain
Steven Byrnes
2y
26
139
Selection vs Control
abramdemski
3y
25
136
Inner Alignment in Salt-Starved Rats
Steven Byrnes
2y
39
127
A Semitechnical Introductory Dialogue on Solomonoff Induction
Eliezer Yudkowsky
1y
34
118
Reframing Superintelligence: Comprehensive AI Services as General Intelligence
Rohin Shah
3y
75
110
Book review: "A Thousand Brains" by Jeff Hawkins
Steven Byrnes
1y
18
103
What's General-Purpose Search, And Why Might We Expect To See It In Trained ML Systems?
johnswentworth
4mo
15
103
Externalized reasoning oversight: a research direction for language model alignment
tamera
4mo
22
103
Demons in Imperfect Search
johnswentworth
2y
21