Go Back
Choose this branch
You can't go any further
meritocratic
regular
democratic
hot
top
alive
98 posts
Existential Risk
Biosecurity
39 posts
Academic Papers
68
Far-UVC Light Update: No, LEDs are not around the corner (tweetstorm)
Davidmanheim
1mo
27
-12
AI can exploit safety plans posted on the Internet
Peter S. Park
16d
4
20
Intercept article about lab accidents
ChristianKl
1mo
9
6
AI Safety in a Vulnerable World: Requesting Feedback on Preliminary Thoughts
Jordan Arel
14d
2
32
4 Key Assumptions in AI Safety
Prometheus
1mo
5
3
Why do we post our AI safety plans on the Internet?
Peter S. Park
1mo
4
14
Value of Querying 100+ People About Humanity's Future
rodeo_flagellum
1mo
3
25
Double Asteroid Redirection Test succeeds
sanxiyn
2mo
5
26
New US Senate Bill on X-Risk Mitigation [Linkpost]
Evan R. Murphy
5mo
12
47
The Dumbest Possible Gets There First
Artaxerxes
4mo
7
24
The Shape of Things to Come
Alex Beyman
2mo
3
8
Case Rates to Sequencing Reads
jefftk
3mo
4
6
X-risk Mitigation Does Actually Require Longtermism
DragonGod
1mo
1
37
Cultivating Valiance
Shoshannah Tekofsky
4mo
4
14
Characterizing Intrinsic Compositionality in Transformers with Tree Projections
Ulisse Mini
1mo
2
219
Some AI research areas and their relevance to existential safety
Andrew_Critch
2y
40
185
2021 AI Alignment Literature Review and Charity Comparison
Larks
12mo
26
7
The Mind Is Not Designed For Thinking
CronoDAS
13y
7
27
New paper from MIRI: "Toward idealized decision theory"
So8res
8y
22
33
New paper: Corrigibility with Utility Preservation
Koen.Holtman
3y
11
7
[Preprint for commenting] Digital Immortality: Theory and Protocol for Indirect Mind Uploading
avturchin
4y
5
23
Hope Function
gwern
10y
8
30
Some conceptual highlights from “Disjunctive Scenarios of Catastrophic AI Risk”
Kaj_Sotala
4y
4
3
Social Choice Ethics in Artificial Intelligence (paper challenging CEV-like approaches to choosing an AI's values)
Kaj_Sotala
5y
0
36
[link] Why Self-Control Seems (but may not be) Limited
Kaj_Sotala
8y
10
10
A discussion of the paper, "Large Language Models are Zero-Shot Reasoners"
HiroSakuraba
6mo
0
7
IQ Scores Fail to Predict Academic Performance in Children With Autism
InquilineKea
12y
9
13
Study on what makes people approve or condemn mind upload technology; references LW
Kaj_Sotala
4y
0