Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

42 posts GPT Bounties & Prizes (active) AI-assisted Alignment AI Safety Public Materials List of Links

8 posts Moore's Law Compute Nanotechnology Computer Science Tripwire Quantum Mechanics

90 [Link] Why I’m optimistic about OpenAI’s alignment approach

janleike

15d

13

18 An exploration of GPT-2's embedding weights

Adam Scherlis

7d

2

12 Research request (alignment strategy): Deep dive on "making AI solve alignment for us"

JanBrauner

19d

3

12 [LINK] - ChatGPT discussion

JanBrauner

19d

7

8 Distribution Shifts and The Importance of AI Safety

Leon Lang

2mo

2

5 AI-assisted list of ten concrete alignment things to do right now

lcmgcd

3mo

5

68 NeurIPS ML Safety Workshop 2022

Dan H

4mo

2

22 [$20K in Prizes] AI Safety Arguments Competition

Dan H

7mo

543

125 Developmental Stages of GPTs

orthonormal

2y

74

0 New(ish) AI control ideas

Stuart_Armstrong

5y

0

96 Collection of GPT-3 results

Kaj_Sotala

2y

24

145 interpreting GPT: the logit lens

nostalgebraist

2y

32

164 MIRI comments on Cotra's "Case for Aligning Narrowly Superhuman Models"

Rob Bensinger

1y

13

105 Alignment As A Bottleneck To Usefulness Of GPT-3

johnswentworth

2y

57

3 Corrigibility thoughts I: caring about multiple things

Stuart_Armstrong

5y

0

20 Algorithmic Similarity

LukasM

3y

10

67 Compute Trends Across Three eras of Machine Learning

Jsevillamol

10mo

13

19 Implications of Quantum Computing for Artificial Intelligence Alignment Research

Jsevillamol

3y

3

120 Moore's Law, AI, and the pace of progress

Veedrac

1y

39

41 Reasons compute may not drive AI capabilities growth

Kythe

4y

10

44 Verification and Transparency

DanielFilan

3y

6

48 Predicting GPU performance

Marius Hobbhahn

6d

24