Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
48 posts
Community
Center for Human-Compatible AI (CHAI)
Moral Uncertainty
Regulation and AI Risk
Grants & Fundraising Opportunities
Future of Humanity Institute (FHI)
Population Ethics
Utilitarianism
The SF Bay Area
Future of Life Institute (FLI)
Disagreement
Events (Community)
11 posts
Agent Foundations
Machine Intelligence Research Institute (MIRI)
Cognitive Reduction
Dissolving the Question
203
2018 AI Alignment Literature Review and Charity Comparison
Larks
4y
26
127
2019 AI Alignment Literature Review and Charity Comparison
Larks
3y
18
99
Announcing the Introduction to ML Safety course
Dan H
4mo
6
94
Call for research on evaluating alignment (funding + advice available)
Beth Barnes
1y
11
94
Introducing the ML Safety Scholars Program
Dan H
7mo
2
93
Full-time AGI Safety!
Steven Byrnes
1y
3
91
AI Safety and Neighboring Communities: A Quick-Start Guide, as of Summer 2022
Sam Bowman
3mo
2
87
Announcing AI Alignment Awards: $100k research contests about goal misgeneralization & corrigibility
Akash
28d
20
85
Apply to the ML for Alignment Bootcamp (MLAB) in Berkeley [Jan 3 - Jan 22]
habryka
1y
4
68
ARC is hiring!
paulfchristiano
1y
2
66
AGI Safety Fundamentals curriculum and application
Richard_Ngo
1y
0
63
Seeking Interns/RAs for Mechanistic Interpretability Projects
Neel Nanda
4mo
0
58
Jobs: Help scale up LM alignment research at NYU
Sam Bowman
7mo
1
58
Apply for research internships at ARC!
paulfchristiano
11mo
0
297
Why Agent Foundations? An Overly Abstract Explanation
johnswentworth
9mo
54
250
The Rocket Alignment Problem
Eliezer Yudkowsky
4y
42
100
What I’ll be doing at MIRI
evhub
3y
6
85
Prize and fast track to alignment research at ALTER
Vanessa Kosoy
3mo
4
83
Challenges with Breaking into MIRI-Style Research
Chris_Leong
11mo
15
74
Another take on agent foundations: formalizing zero-shot reasoning
zhukeepa
4y
20
61
Clarifying the Agent-Like Structure Problem
johnswentworth
2mo
14
37
Grokking the Intentional Stance
jbkjr
1y
20
24
On motivations for MIRI's highly reliable agent design research
jessicata
5y
1
23
Bridging Expected Utility Maximization and Optimization
Whispermute
4mo
5
15
My current take on the Paul-MIRI disagreement on alignability of messy AI
jessicata
5y
0