Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

166 posts AI Risk Missing Moods Seed AI

29 posts Threat Models Sharp Left Turn Multipolar Scenarios Technological Unemployment

77 AI Safety Seems Hard to Measure

HoldenKarnofsky

12d

5

455 Counterarguments to the basic AI x-risk case

KatjaGrace

2mo

122

64 Who are some prominent reasonable people who are confident that AI won't kill everyone?

Optimization Process

15d

40

1039 Where I agree and disagree with Eliezer

paulfchristiano

6mo

205

113 AI will change the world, but won’t take it over by playing “3-dimensional chess”.

boazbarak

28d

86

1043 AGI Ruin: A List of Lethalities

Eliezer Yudkowsky

6mo

653

103 Meta AI announces Cicero: Human-Level Diplomacy play (with dialogue)

Jacy Reese Anthis

28d

64

117 Am I secretly excited for AI getting weird?

porby

1mo

4

28 Apply to attend winter AI alignment workshops (Dec 28-30 & Jan 3-5) near Berkeley

Akash

19d

1

21 Aligned Behavior is not Evidence of Alignment Past a Certain Level of Intelligence

Ronny Fernandez

15d

5

87 Niceness is unnatural

So8res

2mo

18

148 Worlds Where Iterative Design Fails

johnswentworth

3mo

26

15 Race to the Top: Benchmarks for AI Safety

Isabella Duan

16d

2

170 AGI ruin scenarios are likely (and disjunctive)

So8res

4mo

37

42 AI Neorealism: a threat model & success criterion for existential safety

davidad

5d

0

148 Clarifying AI X-risk

zac_kenton

1mo

23

14 How is the "sharp left turn defined"?

Chris_Leong

12d

3

68 Threat Model Literature Review

zac_kenton

1mo

4

31 Refining the Sharp Left Turn threat model, part 2: applying alignment techniques

Vika

25d

4

309 A central AI alignment problem: capabilities generalization, and the sharp left turn

So8res

6mo

48

47 AI X-risk >35% mostly based on a recent peer-reviewed argument

michaelcohen

1mo

31

165 AI Could Defeat All Of Us Combined

HoldenKarnofsky

6mo

29

87 Refining the Sharp Left Turn threat model, part 1: claims and mechanisms

Vika

4mo

3

70 We may be able to see sharp left turns coming

Ethan Perez

3mo

26

33 It matters when the first sharp left turn happens

Adam Jermyn

2mo

9

266 What Multipolar Failure Looks Like, and Robust Agent-Agnostic Processes (RAAPs)

Andrew_Critch

1y

60

253 Another (outer) alignment failure story

paulfchristiano

1y

38

437 What failure looks like

paulfchristiano

3y

49