Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
43 posts
World Optimization
Practical
AI Safety Camp
Ethics & Morality
Symbol Grounding
Security Mindset
Software Tools
Surveys
Careers
Updated Beliefs (examples of)
Organizational Culture & Design
Covid-19
8 posts
Existential Risk
Academic Papers
106
Thoughts on AGI organizations and capabilities work
Rob Bensinger
13d
17
27
Deconfusing Direct vs Amortised Optimization
beren
18d
6
106
Don't leave your fingerprints on the future
So8res
2mo
32
92
An Update on Academia vs. Industry (one year into my faculty job)
David Scott Krueger (formerly: capybaralet)
3mo
18
28
A survey of tool use and workflows in alignment research
Logan Riggs
9mo
5
20
Some ideas for epistles to the AI ethicists
Charlie Steiner
3mo
0
44
What technologies could cause world GDP doubling times to be <8 years?
Daniel Kokotajlo
2y
44
39
AI x-risk reduction: why I chose academia over industry
David Scott Krueger (formerly: capybaralet)
1y
14
19
Reading the ethicists 2: Hunting for AI alignment papers
Charlie Steiner
6mo
1
52
Where are intentions to be found?
Alex Flint
1y
12
119
List of resolved confusions about IDA
Wei_Dai
3y
18
18
Do yourself a FAVAR: security mindset
lcmgcd
6mo
2
73
AI Safety Papers: An App for the TAI Safety Database
ozziegooen
1y
13
29
A test for symbol grounding methods: true zero-sum games
Stuart_Armstrong
3y
2
25
The Dumbest Possible Gets There First
Artaxerxes
4mo
7
13
Concrete Advice for Forming Inside Views on AI Safety
Neel Nanda
4mo
6
39
New paper: Corrigibility with Utility Preservation
Koen.Holtman
3y
11
26
[Linkpost] Existential Risk Analysis in Empirical Research Papers
Dan H
5mo
0
31
Techniques for optimizing worst-case performance
paulfchristiano
3y
12
33
What I talk about when I talk about AI x-risk: 3 core claims I want machine learning researchers to address.
David Scott Krueger (formerly: capybaralet)
3y
13
46
A list of good heuristics that the case for AI x-risk fails
David Scott Krueger (formerly: capybaralet)
3y
14
191
Some AI research areas and their relevance to existential safety
Andrew_Critch
2y
40