Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

14 posts Threat Models Sharp Left Turn

18 posts Coordination / Cooperation Fiction Site Meta AI Risk Concrete Stories AMA Multipolar Scenarios Prisoner's Dilemma Paperclip Maximizer Moloch Q&A (format)

38 AI Neorealism: a threat model & success criterion for existential safety

davidad

5d

0

42 Refining the Sharp Left Turn threat model, part 2: applying alignment techniques

Vika

25d

4

214 A central AI alignment problem: capabilities generalization, and the sharp left turn

So8res

6mo

48

28 AI X-risk >35% mostly based on a recent peer-reviewed argument

michaelcohen

1mo

31

59 Refining the Sharp Left Turn threat model, part 1: claims and mechanisms

Vika

4mo

3

33 We may be able to see sharp left turns coming

Ethan Perez

3mo

26

180 Another (outer) alignment failure story

paulfchristiano

1y

38

109 Less Realistic Tales of Doom

Mark Xu

1y

13

87 Rogue AGI Embodies Valuable Intellectual Property

Mark Xu

1y

9

222 What failure looks like

paulfchristiano

3y

49

58 Vignettes Workshop (AI Impacts)

Daniel Kokotajlo

1y

3

43 Distinguishing AI takeover scenarios

Sam Clarke

1y

11

51 Survey on AI existential risk scenarios

Sam Clarke

1y

11

26 Investigating AI Takeover Scenarios

Sammy Martin

1y

1

121 The next decades might be wild

Marius Hobbhahn

5d

21

132 AI coordination needs clear wins

evhub

3mo

15

255 It Looks Like You're Trying To Take Over The World

gwern

9mo

125

118 Late 2021 MIRI Conversations: AMA / Discussion

Rob Bensinger

9mo

208

115 Welcome & FAQ!

Ruby

1y

8

154 What Multipolar Failure Looks Like, and Robust Agent-Agnostic Processes (RAAPs)

Andrew_Critch

1y

60

65 We're Redwood Research, we do applied alignment research, AMA

Nate Thomas

1y

3

64 AI takeoff story: a continuation of progress by other means

Edouard Harris

1y

13

42 My Overview of the AI Alignment Landscape: Threat Models

Neel Nanda

12mo

4

76 What Failure Looks Like: Distilling the Discussion

Ben Pace

2y

14

116 Prisoners' Dilemma with Costs to Modeling

Scott Garrabrant

4y

20

48 Clarifying “What failure looks like”

Sam Clarke

2y

14

26 (apologies for Alignment Forum server outage last night)

Ruby

1y

1

91 Introducing the AI Alignment Forum (FAQ)

habryka

4y

8