Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
27 posts
World Optimization
AI Safety Camp
Ethics & Morality
Surveys
Covid-19
16 posts
Practical
Symbol Grounding
Security Mindset
Software Tools
Careers
Updated Beliefs (examples of)
Organizational Culture & Design
QURI
106
Don't leave your fingerprints on the future
So8res
2mo
32
80
Nearcast-based "deployment problem" analysis
HoldenKarnofsky
3mo
2
107
Moral strategies at different capability levels
Richard_Ngo
4mo
14
85
Reshaping the AI Industry
Thane Ruthenis
6mo
34
173
Morality is Scary
Wei_Dai
1y
125
20
Some ideas for epistles to the AI ethicists
Charlie Steiner
3mo
0
14
Reflection Mechanisms as an Alignment target: A follow-up survey
Marius Hobbhahn
2mo
2
76
Safety-capabilities tradeoff dials are inevitable in AGI
Steven Byrnes
1y
4
110
How do we prepare for final crunch time?
Eli Tyre
1y
30
26
Reflection Mechanisms as an Alignment target: A survey
Marius Hobbhahn
6mo
1
159
Possible takeaways from the coronavirus pandemic for slow AI takeoff
Vika
2y
36
28
A survey of tool use and workflows in alignment research
Logan Riggs
9mo
5
19
Reading the ethicists 2: Hunting for AI alignment papers
Charlie Steiner
6mo
1
63
"Existential risk from AI" survey results
Rob Bensinger
1y
8
106
Thoughts on AGI organizations and capabilities work
Rob Bensinger
13d
17
27
Deconfusing Direct vs Amortised Optimization
beren
18d
6
256
Six Dimensions of Operational Adequacy in AGI Projects
Eliezer Yudkowsky
6mo
65
34
Some advice on independent research
Marius Hobbhahn
1mo
4
92
An Update on Academia vs. Industry (one year into my faculty job)
David Scott Krueger (formerly: capybaralet)
3mo
18
82
Linkpost: Github Copilot productivity experiment
Daniel Kokotajlo
3mo
4
215
How To Get Into Independent Research On Alignment/Agency
johnswentworth
1y
33
34
New tool for exploring EA Forum, LessWrong and Alignment Forum - Tree of Tags
Filip Sondej
3mo
2
14
POWERplay: An open-source toolchain to study AI power-seeking
Edouard Harris
1mo
0
73
AI Safety Papers: An App for the TAI Safety Database
ozziegooen
1y
13
58
The Codex Skeptic FAQ
Michaƫl Trazzi
1y
24
18
Do yourself a FAVAR: security mindset
lcmgcd
6mo
2
119
List of resolved confusions about IDA
Wei_Dai
3y
18
23
Classical symbol grounding and causal graphs
Stuart_Armstrong
1y
2