Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

42 posts GPT Bounties & Prizes (active) AI-assisted Alignment AI Safety Public Materials List of Links

8 posts Moore's Law Compute Nanotechnology Computer Science Tripwire Quantum Mechanics

96 [Link] Why I’m optimistic about OpenAI’s alignment approach

janleike

15d

13

34 An exploration of GPT-2's embedding weights

Adam Scherlis

7d

2

20 Research request (alignment strategy): Deep dive on "making AI solve alignment for us"

JanBrauner

19d

3

14 [LINK] - ChatGPT discussion

JanBrauner

19d

7

26 Distribution Shifts and The Importance of AI Safety

Leon Lang

2mo

2

11 AI-assisted list of ten concrete alignment things to do right now

lcmgcd

3mo

5

76 NeurIPS ML Safety Workshop 2022

Dan H

4mo

2

126 [$20K in Prizes] AI Safety Arguments Competition

Dan H

7mo

543

155 Developmental Stages of GPTs

orthonormal

2y

74

0 New(ish) AI control ideas

Stuart_Armstrong

5y

0

82 Collection of GPT-3 results

Kaj_Sotala

2y

24

171 interpreting GPT: the logit lens

nostalgebraist

2y

32

108 MIRI comments on Cotra's "Case for Aligning Narrowly Superhuman Models"

Rob Bensinger

1y

13

117 Alignment As A Bottleneck To Usefulness Of GPT-3

johnswentworth

2y

57

1 Corrigibility thoughts I: caring about multiple things

Stuart_Armstrong

5y

0

34 Algorithmic Similarity

LukasM

3y

10

115 Compute Trends Across Three eras of Machine Learning

Jsevillamol

10mo

13

29 Implications of Quantum Computing for Artificial Intelligence Alignment Research

Jsevillamol

3y

3

120 Moore's Law, AI, and the pace of progress

Veedrac

1y

39

43 Reasons compute may not drive AI capabilities growth

Kythe

4y

10

24 Verification and Transparency

DanielFilan

3y

6

70 Predicting GPU performance

Marius Hobbhahn

6d

24