Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

27 posts World Optimization AI Safety Camp Ethics & Morality Surveys Covid-19

16 posts Practical Symbol Grounding Security Mindset Software Tools Careers Updated Beliefs (examples of) Organizational Culture & Design QURI

106 Don't leave your fingerprints on the future

So8res

2mo

32

80 Nearcast-based "deployment problem" analysis

HoldenKarnofsky

3mo

2

107 Moral strategies at different capability levels

Richard_Ngo

4mo

14

85 Reshaping the AI Industry

Thane Ruthenis

6mo

34

173 Morality is Scary

Wei_Dai

1y

125

20 Some ideas for epistles to the AI ethicists

Charlie Steiner

3mo

0

14 Reflection Mechanisms as an Alignment target: A follow-up survey

Marius Hobbhahn

2mo

2

76 Safety-capabilities tradeoff dials are inevitable in AGI

Steven Byrnes

1y

4

110 How do we prepare for final crunch time?

Eli Tyre

1y

30

26 Reflection Mechanisms as an Alignment target: A survey

Marius Hobbhahn

6mo

1

159 Possible takeaways from the coronavirus pandemic for slow AI takeoff

Vika

2y

36

28 A survey of tool use and workflows in alignment research

Logan Riggs

9mo

5

19 Reading the ethicists 2: Hunting for AI alignment papers

Charlie Steiner

6mo

1

63 "Existential risk from AI" survey results

Rob Bensinger

1y

8

106 Thoughts on AGI organizations and capabilities work

Rob Bensinger

13d

17

27 Deconfusing Direct vs Amortised Optimization

beren

18d

6

256 Six Dimensions of Operational Adequacy in AGI Projects

Eliezer Yudkowsky

6mo

65

34 Some advice on independent research

Marius Hobbhahn

1mo

4

92 An Update on Academia vs. Industry (one year into my faculty job)

David Scott Krueger (formerly: capybaralet)

3mo

18

82 Linkpost: Github Copilot productivity experiment

Daniel Kokotajlo

3mo

4

215 How To Get Into Independent Research On Alignment/Agency

johnswentworth

1y

33

34 New tool for exploring EA Forum, LessWrong and Alignment Forum - Tree of Tags

Filip Sondej

3mo

2

14 POWERplay: An open-source toolchain to study AI power-seeking

Edouard Harris

1mo

0

73 AI Safety Papers: An App for the TAI Safety Database

ozziegooen

1y

13

58 The Codex Skeptic FAQ

Michaël Trazzi

1y

24

18 Do yourself a FAVAR: security mindset

lcmgcd

6mo

2

119 List of resolved confusions about IDA

Wei_Dai

3y

18

23 Classical symbol grounding and causal graphs

Stuart_Armstrong

1y

2