Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

42 posts GPT Bounties & Prizes (active) AI-assisted Alignment AI Safety Public Materials List of Links

8 posts Moore's Law Compute Nanotechnology Computer Science Tripwire Quantum Mechanics

93 [Link] Why I’m optimistic about OpenAI’s alignment approach

janleike

15d

13

26 An exploration of GPT-2's embedding weights

Adam Scherlis

7d

2

16 Research request (alignment strategy): Deep dive on "making AI solve alignment for us"

JanBrauner

19d

3

13 [LINK] - ChatGPT discussion

JanBrauner

19d

7

17 Distribution Shifts and The Importance of AI Safety

Leon Lang

2mo

2

8 AI-assisted list of ten concrete alignment things to do right now

lcmgcd

3mo

5

72 NeurIPS ML Safety Workshop 2022

Dan H

4mo

2

74 [$20K in Prizes] AI Safety Arguments Competition

Dan H

7mo

543

140 Developmental Stages of GPTs

orthonormal

2y

74

0 New(ish) AI control ideas

Stuart_Armstrong

5y

0

89 Collection of GPT-3 results

Kaj_Sotala

2y

24

158 interpreting GPT: the logit lens

nostalgebraist

2y

32

136 MIRI comments on Cotra's "Case for Aligning Narrowly Superhuman Models"

Rob Bensinger

1y

13

111 Alignment As A Bottleneck To Usefulness Of GPT-3

johnswentworth

2y

57

2 Corrigibility thoughts I: caring about multiple things

Stuart_Armstrong

5y

0

27 Algorithmic Similarity

LukasM

3y

10

91 Compute Trends Across Three eras of Machine Learning

Jsevillamol

10mo

13

24 Implications of Quantum Computing for Artificial Intelligence Alignment Research

Jsevillamol

3y

3

120 Moore's Law, AI, and the pace of progress

Veedrac

1y

39

42 Reasons compute may not drive AI capabilities growth

Kythe

4y

10

34 Verification and Transparency

DanielFilan

3y

6

59 Predicting GPU performance

Marius Hobbhahn

6d

24