Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
43 posts
World Optimization
Practical
AI Safety Camp
Ethics & Morality
Symbol Grounding
Security Mindset
Software Tools
Surveys
Careers
Updated Beliefs (examples of)
Organizational Culture & Design
Covid-19
8 posts
Existential Risk
Academic Papers
94
Thoughts on AGI organizations and capabilities work
Rob Bensinger
13d
17
48
Deconfusing Direct vs Amortised Optimization
beren
18d
6
93
Don't leave your fingerprints on the future
So8res
2mo
32
118
An Update on Academia vs. Industry (one year into my faculty job)
David Scott Krueger (formerly: capybaralet)
3mo
18
43
A survey of tool use and workflows in alignment research
Logan Riggs
9mo
5
19
Some ideas for epistles to the AI ethicists
Charlie Steiner
3mo
0
43
What technologies could cause world GDP doubling times to be <8 years?
Daniel Kokotajlo
2y
44
56
AI x-risk reduction: why I chose academia over industry
David Scott Krueger (formerly: capybaralet)
1y
14
21
Reading the ethicists 2: Hunting for AI alignment papers
Charlie Steiner
6mo
1
44
Where are intentions to be found?
Alex Flint
1y
12
94
List of resolved confusions about IDA
Wei_Dai
3y
18
19
Do yourself a FAVAR: security mindset
lcmgcd
6mo
2
74
AI Safety Papers: An App for the TAI Safety Database
ozziegooen
1y
13
22
A test for symbol grounding methods: true zero-sum games
Stuart_Armstrong
3y
2
35
The Dumbest Possible Gets There First
Artaxerxes
4mo
7
18
Concrete Advice for Forming Inside Views on AI Safety
Neel Nanda
4mo
6
35
New paper: Corrigibility with Utility Preservation
Koen.Holtman
3y
11
40
[Linkpost] Existential Risk Analysis in Empirical Research Papers
Dan H
5mo
0
23
Techniques for optimizing worst-case performance
paulfchristiano
3y
12
28
What I talk about when I talk about AI x-risk: 3 core claims I want machine learning researchers to address.
David Scott Krueger (formerly: capybaralet)
3y
13
41
A list of good heuristics that the case for AI x-risk fails
David Scott Krueger (formerly: capybaralet)
3y
14
199
Some AI research areas and their relevance to existential safety
Andrew_Critch
2y
40