Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

32 posts Threat Models Coordination / Cooperation Sharp Left Turn Fiction AI Risk Concrete Stories Site Meta AMA Multipolar Scenarios Prisoner's Dilemma Moloch Paperclip Maximizer Q&A (format)

51 posts World Optimization Existential Risk Practical Academic Papers Ethics & Morality AI Safety Camp Symbol Grounding Security Mindset Software Tools Careers Surveys Updated Beliefs (examples of)

121 The next decades might be wild

Marius Hobbhahn

5d

21

28 AI X-risk >35% mostly based on a recent peer-reviewed argument

michaelcohen

1mo

31

33 We may be able to see sharp left turns coming

Ethan Perez

3mo

26

42 Refining the Sharp Left Turn threat model, part 2: applying alignment techniques

Vika

25d

4

214 A central AI alignment problem: capabilities generalization, and the sharp left turn

So8res

6mo

48

132 AI coordination needs clear wins

evhub

3mo

15

59 Refining the Sharp Left Turn threat model, part 1: claims and mechanisms

Vika

4mo

3

115 Welcome & FAQ!

Ruby

1y

8

118 Late 2021 MIRI Conversations: AMA / Discussion

Rob Bensinger

9mo

208

42 My Overview of the AI Alignment Landscape: Threat Models

Neel Nanda

12mo

4

76 What Failure Looks Like: Distilling the Discussion

Ben Pace

2y

14

180 Another (outer) alignment failure story

paulfchristiano

1y

38

87 Rogue AGI Embodies Valuable Intellectual Property

Mark Xu

1y

9

91 Introducing the AI Alignment Forum (FAQ)

habryka

4y

8

106 Thoughts on AGI organizations and capabilities work

Rob Bensinger

13d

17

27 Deconfusing Direct vs Amortised Optimization

beren

18d

6

106 Don't leave your fingerprints on the future

So8res

2mo

32

92 An Update on Academia vs. Industry (one year into my faculty job)

David Scott Krueger (formerly: capybaralet)

3mo

18

25 The Dumbest Possible Gets There First

Artaxerxes

4mo

7

13 Concrete Advice for Forming Inside Views on AI Safety

Neel Nanda

4mo

6

28 A survey of tool use and workflows in alignment research

Logan Riggs

9mo

5

20 Some ideas for epistles to the AI ethicists

Charlie Steiner

3mo

0

44 What technologies could cause world GDP doubling times to be <8 years?

Daniel Kokotajlo

2y

44

39 New paper: Corrigibility with Utility Preservation

Koen.Holtman

3y

11

39 AI x-risk reduction: why I chose academia over industry

David Scott Krueger (formerly: capybaralet)

1y

14

26 [Linkpost] Existential Risk Analysis in Empirical Research Papers

Dan H

5mo

0

19 Reading the ethicists 2: Hunting for AI alignment papers

Charlie Steiner

6mo

1

31 Techniques for optimizing worst-case performance

paulfchristiano

3y

12