Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
21 posts
Ought
AI interpretability
Redwood Research
Anthropic
Alignment Research Center
7 posts
Nonlinear Fund
Superintelligence
AI Alignment Forum
Instrumental convergence thesis
Malignant AI failure mode
21
Why mechanistic interpretability does not and cannot contribute to long-term AGI safety (from messages with a friend)
Remmelt
1d
2
22
The limited upside of interpretability
Peter S. Park
1mo
3
41
AMA: Ought
stuhlmueller
4mo
52
50
A Barebones Guide to Mechanistic Interpretability Prerequisites
Neel Nanda
21d
1
109
Apply to the second ML for Alignment Bootcamp (MLAB 2) in Berkeley [Aug 15 - Fri Sept 2]
Buck
7mo
7
88
ARC is hiring alignment theory researchers
Paul_Christiano
1y
3
15
Binary prediction database and tournament
amandango
2y
0
15
Chris Olah on working at top AI labs without an undergrad degree
80000_Hours
1y
0
75
Redwood Research is hiring for several roles
Jack R
1y
0
2
Is it possible that SBF-linked funds haven't yet been transferred to Anthropic or that Anthropic would have to return these funds?
donegal
1mo
0
7
Andreas Stuhlmüller: Training ML systems to answer open-ended questions
EA Global
2y
1
52
Ought: why it matters and ways to help
Paul_Christiano
3y
5
10
[Link] "Machine Learning Projects for IDA" (Ought)
Milan_Griffes
3y
0
20
Automating reasoning about the future at Ought
jungofthewon
2y
0
8
I there a demo of "You can't fetch the coffee if you're dead"?
Ram Rachum
1mo
3
182
Listen to more EA content with The Nonlinear Library
Kat Woods
1y
89
170
EA needs a hiring agency and Nonlinear will fund you to start one
Kat Woods
11mo
12
12
The Case for Superintelligence Safety As A Cause: A Non-Technical Summary
HunterJay
3y
9
7
[Linkpost] The Problem With The Current State of AGI Definitions
Yitz
6mo
0
59
I’ll pay you a $1,000 bounty for coming up with a good bounty (x-risk related)
Emerson Spartz
1y
48
6
How likely are malign priors over objectives? [aborted WIP]
David Johnston
1mo
0