Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

43 posts World Optimization Practical AI Safety Camp Ethics & Morality Symbol Grounding Security Mindset Software Tools Surveys Careers Updated Beliefs (examples of) Organizational Culture & Design Covid-19

8 posts Existential Risk Academic Papers

106 Thoughts on AGI organizations and capabilities work

Rob Bensinger

13d

17

27 Deconfusing Direct vs Amortised Optimization

beren

18d

6

106 Don't leave your fingerprints on the future

So8res

2mo

32

92 An Update on Academia vs. Industry (one year into my faculty job)

David Scott Krueger (formerly: capybaralet)

3mo

18

28 A survey of tool use and workflows in alignment research

Logan Riggs

9mo

5

20 Some ideas for epistles to the AI ethicists

Charlie Steiner

3mo

0

44 What technologies could cause world GDP doubling times to be <8 years?

Daniel Kokotajlo

2y

44

39 AI x-risk reduction: why I chose academia over industry

David Scott Krueger (formerly: capybaralet)

1y

14

19 Reading the ethicists 2: Hunting for AI alignment papers

Charlie Steiner

6mo

1

52 Where are intentions to be found?

Alex Flint

1y

12

119 List of resolved confusions about IDA

Wei_Dai

3y

18

18 Do yourself a FAVAR: security mindset

lcmgcd

6mo

2

73 AI Safety Papers: An App for the TAI Safety Database

ozziegooen

1y

13

29 A test for symbol grounding methods: true zero-sum games

Stuart_Armstrong

3y

2

25 The Dumbest Possible Gets There First

Artaxerxes

4mo

7

13 Concrete Advice for Forming Inside Views on AI Safety

Neel Nanda

4mo

6

39 New paper: Corrigibility with Utility Preservation

Koen.Holtman

3y

11

26 [Linkpost] Existential Risk Analysis in Empirical Research Papers

Dan H

5mo

0

31 Techniques for optimizing worst-case performance

paulfchristiano

3y

12

33 What I talk about when I talk about AI x-risk: 3 core claims I want machine learning researchers to address.

David Scott Krueger (formerly: capybaralet)

3y

13

46 A list of good heuristics that the case for AI x-risk fails

David Scott Krueger (formerly: capybaralet)

3y

14

191 Some AI research areas and their relevance to existential safety

Andrew_Critch

2y

40