Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

43 posts World Optimization Practical AI Safety Camp Ethics & Morality Symbol Grounding Security Mindset Software Tools Surveys Careers Updated Beliefs (examples of) Organizational Culture & Design Covid-19

8 posts Existential Risk Academic Papers

94 Thoughts on AGI organizations and capabilities work

Rob Bensinger

13d

17

48 Deconfusing Direct vs Amortised Optimization

beren

18d

6

93 Don't leave your fingerprints on the future

So8res

2mo

32

118 An Update on Academia vs. Industry (one year into my faculty job)

David Scott Krueger (formerly: capybaralet)

3mo

18

43 A survey of tool use and workflows in alignment research

Logan Riggs

9mo

5

19 Some ideas for epistles to the AI ethicists

Charlie Steiner

3mo

0

43 What technologies could cause world GDP doubling times to be <8 years?

Daniel Kokotajlo

2y

44

56 AI x-risk reduction: why I chose academia over industry

David Scott Krueger (formerly: capybaralet)

1y

14

21 Reading the ethicists 2: Hunting for AI alignment papers

Charlie Steiner

6mo

1

44 Where are intentions to be found?

Alex Flint

1y

12

94 List of resolved confusions about IDA

Wei_Dai

3y

18

19 Do yourself a FAVAR: security mindset

lcmgcd

6mo

2

74 AI Safety Papers: An App for the TAI Safety Database

ozziegooen

1y

13

22 A test for symbol grounding methods: true zero-sum games

Stuart_Armstrong

3y

2

35 The Dumbest Possible Gets There First

Artaxerxes

4mo

7

18 Concrete Advice for Forming Inside Views on AI Safety

Neel Nanda

4mo

6

35 New paper: Corrigibility with Utility Preservation

Koen.Holtman

3y

11

40 [Linkpost] Existential Risk Analysis in Empirical Research Papers

Dan H

5mo

0

23 Techniques for optimizing worst-case performance

paulfchristiano

3y

12

28 What I talk about when I talk about AI x-risk: 3 core claims I want machine learning researchers to address.

David Scott Krueger (formerly: capybaralet)

3y

13

41 A list of good heuristics that the case for AI x-risk fails

David Scott Krueger (formerly: capybaralet)

3y

14

199 Some AI research areas and their relevance to existential safety

Andrew_Critch

2y

40