Tree of Tags

Go Back

Choose this branch

You can't go any further

meritocratic regular democratic

hot top alive

98 posts Existential Risk Biosecurity

39 posts Academic Papers

70 Far-UVC Light Update: No, LEDs are not around the corner (tweetstorm)

Davidmanheim

1mo

27

-19 AI can exploit safety plans posted on the Internet

Peter S. Park

16d

4

23 Intercept article about lab accidents

ChristianKl

1mo

9

3 AI Safety in a Vulnerable World: Requesting Feedback on Preliminary Thoughts

Jordan Arel

14d

2

20 4 Key Assumptions in AI Safety

Prometheus

1mo

5

3 Why do we post our AI safety plans on the Internet?

Peter S. Park

1mo

4

9 Value of Querying 100+ People About Humanity's Future

rodeo_flagellum

1mo

3

19 Double Asteroid Redirection Test succeeds

sanxiyn

2mo

5

35 New US Senate Bill on X-Risk Mitigation [Linkpost]

Evan R. Murphy

5mo

12

35 The Dumbest Possible Gets There First

Artaxerxes

4mo

7

12 The Shape of Things to Come

Alex Beyman

2mo

3

15 Case Rates to Sequencing Reads

jefftk

3mo

4

5 X-risk Mitigation Does Actually Require Longtermism

DragonGod

1mo

1

35 Cultivating Valiance

Shoshannah Tekofsky

4mo

4

12 Characterizing Intrinsic Compositionality in Transformers with Tree Projections

Ulisse Mini

1mo

2

199 Some AI research areas and their relevance to existential safety

Andrew_Critch

2y

40

164 2021 AI Alignment Literature Review and Charity Comparison

Larks

12mo

26

9 The Mind Is Not Designed For Thinking

CronoDAS

13y

7

41 New paper from MIRI: "Toward idealized decision theory"

So8res

8y

22

35 New paper: Corrigibility with Utility Preservation

Koen.Holtman

3y

11

8 [Preprint for commenting] Digital Immortality: Theory and Protocol for Indirect Mind Uploading

avturchin

4y

5

38 Hope Function

gwern

10y

8

33 Some conceptual highlights from “Disjunctive Scenarios of Catastrophic AI Risk”

Kaj_Sotala

4y

4

3 Social Choice Ethics in Artificial Intelligence (paper challenging CEV-like approaches to choosing an AI's values)

Kaj_Sotala

5y

0

54 [link] Why Self-Control Seems (but may not be) Limited

Kaj_Sotala

8y

10

7 A discussion of the paper, "Large Language Models are Zero-Shot Reasoners"

HiroSakuraba

6mo

0

9 IQ Scores Fail to Predict Academic Performance in Children With Autism

InquilineKea

12y

9

21 Study on what makes people approve or condemn mind upload technology; references LW

Kaj_Sotala

4y

0