Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

32 posts Threat Models Coordination / Cooperation Sharp Left Turn Fiction AI Risk Concrete Stories Site Meta AMA Multipolar Scenarios Prisoner's Dilemma Moloch Paperclip Maximizer Q&A (format)

51 posts World Optimization Existential Risk Practical Academic Papers Ethics & Morality AI Safety Camp Symbol Grounding Security Mindset Software Tools Careers Surveys Updated Beliefs (examples of)

189 The next decades might be wild

Marius Hobbhahn

5d

21

44 AI X-risk >35% mostly based on a recent peer-reviewed argument

michaelcohen

1mo

31

67 We may be able to see sharp left turns coming

Ethan Perez

3mo

26

30 Refining the Sharp Left Turn threat model, part 2: applying alignment techniques

Vika

25d

4

292 A central AI alignment problem: capabilities generalization, and the sharp left turn

So8res

6mo

48

136 AI coordination needs clear wins

evhub

3mo

15

83 Refining the Sharp Left Turn threat model, part 1: claims and mechanisms

Vika

4mo

3

103 Welcome & FAQ!

Ruby

1y

8

120 Late 2021 MIRI Conversations: AMA / Discussion

Rob Bensinger

9mo

208

58 My Overview of the AI Alignment Landscape: Threat Models

Neel Nanda

12mo

4

82 What Failure Looks Like: Distilling the Discussion

Ben Pace

2y

14

240 Another (outer) alignment failure story

paulfchristiano

1y

38

53 Rogue AGI Embodies Valuable Intellectual Property

Mark Xu

1y

9

81 Introducing the AI Alignment Forum (FAQ)

habryka

4y

8

82 Thoughts on AGI organizations and capabilities work

Rob Bensinger

13d

17

69 Deconfusing Direct vs Amortised Optimization

beren

18d

6

80 Don't leave your fingerprints on the future

So8res

2mo

32

144 An Update on Academia vs. Industry (one year into my faculty job)

David Scott Krueger (formerly: capybaralet)

3mo

18

45 The Dumbest Possible Gets There First

Artaxerxes

4mo

7

23 Concrete Advice for Forming Inside Views on AI Safety

Neel Nanda

4mo

6

58 A survey of tool use and workflows in alignment research

Logan Riggs

9mo

5

18 Some ideas for epistles to the AI ethicists

Charlie Steiner

3mo

0

42 What technologies could cause world GDP doubling times to be <8 years?

Daniel Kokotajlo

2y

44

31 New paper: Corrigibility with Utility Preservation

Koen.Holtman

3y

11

73 AI x-risk reduction: why I chose academia over industry

David Scott Krueger (formerly: capybaralet)

1y

14

54 [Linkpost] Existential Risk Analysis in Empirical Research Papers

Dan H

5mo

0

23 Reading the ethicists 2: Hunting for AI alignment papers

Charlie Steiner

6mo

1

15 Techniques for optimizing worst-case performance

paulfchristiano

3y

12