Go Back
Choose this branch
You can't go any further
meritocratic
regular
democratic
hot
top
alive
11 posts
AI interpretability
Redwood Research
Anthropic
Alignment Research Center
10 posts
Ought
26
Why mechanistic interpretability does not and cannot contribute to long-term AGI safety (from messages with a friend)
Remmelt
1d
2
19
The limited upside of interpretability
Peter S. Park
1mo
3
41
A Barebones Guide to Mechanistic Interpretability Prerequisites
Neel Nanda
21d
1
113
Apply to the second ML for Alignment Bootcamp (MLAB 2) in Berkeley [Aug 15 - Fri Sept 2]
Buck
7mo
7
72
ARC is hiring alignment theory researchers
Paul_Christiano
1y
3
14
Chris Olah on working at top AI labs without an undergrad degree
80000_Hours
1y
0
70
Redwood Research is hiring for several roles
Jack R
1y
0
5
Is it possible that SBF-linked funds haven't yet been transferred to Anthropic or that Anthropic would have to return these funds?
donegal
1mo
0
3
Chris Olah on what the hell is going on inside neural networks
80000_Hours
1y
0
57
Join the interpretability research hackathon
Esben Kran
1mo
0
97
We're Redwood Research, we do applied alignment research, AMA
Buck
1y
49
36
AMA: Ought
stuhlmueller
4mo
52
16
Binary prediction database and tournament
amandango
2y
0
5
Andreas Stuhlmüller: Training ML systems to answer open-ended questions
EA Global
2y
1
45
Ought: why it matters and ways to help
Paul_Christiano
3y
5
6
[Link] "Machine Learning Projects for IDA" (Ought)
Milan_Griffes
3y
0
18
Automating reasoning about the future at Ought
jungofthewon
2y
0
10
Estimation and forecasting — an overview (Amanda Ngo)
EA Global
2y
0
4
[Link] "Evaluating Arguments One Step at a Time" (Ought)
Milan_Griffes
2y
0
37
Ought's theory of change
stuhlmueller
8mo
4
17
[Link] "Progress Update October 2019" (Ought)
Milan_Griffes
3y
1