Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

32 posts Threat Models Coordination / Cooperation Sharp Left Turn Fiction AI Risk Concrete Stories Site Meta AMA Multipolar Scenarios Prisoner's Dilemma Moloch Paperclip Maximizer Q&A (format)

51 posts World Optimization Existential Risk Practical Academic Papers Ethics & Morality AI Safety Camp Symbol Grounding Security Mindset Software Tools Careers Surveys Updated Beliefs (examples of)

386 It Looks Like You're Trying To Take Over The World

gwern

9mo

125

319 What failure looks like

paulfchristiano

3y

49

253 A central AI alignment problem: capabilities generalization, and the sharp left turn

So8res

6mo

48

210 Another (outer) alignment failure story

paulfchristiano

1y

38

203 What Multipolar Failure Looks Like, and Robust Agent-Agnostic Processes (RAAPs)

Andrew_Critch

1y

60

155 The next decades might be wild

Marius Hobbhahn

5d

21

134 AI coordination needs clear wins

evhub

3mo

15

119 Late 2021 MIRI Conversations: AMA / Discussion

Rob Bensinger

9mo

208

116 Prisoners' Dilemma with Costs to Modeling

Scott Garrabrant

4y

20

110 Less Realistic Tales of Doom

Mark Xu

1y

13

109 Welcome & FAQ!

Ruby

1y

8

95 Clarifying “What failure looks like”

Sam Clarke

2y

14

86 Introducing the AI Alignment Forum (FAQ)

habryka

4y

8

79 What Failure Looks Like: Distilling the Discussion

Ben Pace

2y

14

314 How To Get Into Independent Research On Alignment/Agency

johnswentworth

1y

33

270 Six Dimensions of Operational Adequacy in AGI Projects

Eliezer Yudkowsky

6mo

65

199 Some AI research areas and their relevance to existential safety

Andrew_Critch

2y

40

175 Morality is Scary

Wei_Dai

1y

125

143 Reshaping the AI Industry

Thane Ruthenis

6mo

34

135 Possible takeaways from the coronavirus pandemic for slow AI takeoff

Vika

2y

36

118 An Update on Academia vs. Industry (one year into my faculty job)

David Scott Krueger (formerly: capybaralet)

3mo

18

116 How do we prepare for final crunch time?

Eli Tyre

1y

30

95 Moral strategies at different capability levels

Richard_Ngo

4mo

14

94 List of resolved confusions about IDA

Wei_Dai

3y

18

94 Thoughts on AGI organizations and capabilities work

Rob Bensinger

13d

17

93 Don't leave your fingerprints on the future

So8res

2mo

32

88 Linkpost: Github Copilot productivity experiment

Daniel Kokotajlo

3mo

4

78 Nearcast-based "deployment problem" analysis

HoldenKarnofsky

3mo

2