Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

42 posts GPT Bounties & Prizes (active) AI-assisted Alignment AI Safety Public Materials List of Links

8 posts Moore's Law Compute Nanotechnology Computer Science Tripwire Quantum Mechanics

93 [Link] Why I’m optimistic about OpenAI’s alignment approach

janleike

15d

13

26 An exploration of GPT-2's embedding weights

Adam Scherlis

7d

2

60 By Default, GPTs Think In Plain Sight

Fabien Roger

1mo

16

31 [ASoT] Finetuning, RL, and GPT's world prior

Jozdien

18d

8

7 Alignment with argument-networks and assessment-predictions

Tor Økland Barstad

7d

3

16 Research request (alignment strategy): Deep dive on "making AI solve alignment for us"

JanBrauner

19d

3

13 [LINK] - ChatGPT discussion

JanBrauner

19d

7

92 Beliefs and Disagreements about Automating Alignment Research

Ian McKenzie

3mo

4

223 New Scaling Laws for Large Language Models

1a3orn

8mo

21

36 Prizes for ML Safety Benchmark Ideas

joshc

1mo

3

151 Godzilla Strategies

johnswentworth

6mo

65

68 $20K In Bounties for AI Safety Public Materials

Dan H

4mo

7

72 NeurIPS ML Safety Workshop 2022

Dan H

4mo

2

33 Recall and Regurgitation in GPT2

Megan Kinniment

2mo

1

59 Predicting GPU performance

Marius Hobbhahn

6d

24

120 Moore's Law, AI, and the pace of progress

Veedrac

1y

39

91 Compute Trends Across Three eras of Machine Learning

Jsevillamol

10mo

13

42 Reasons compute may not drive AI capabilities growth

Kythe

4y

10

34 Verification and Transparency

DanielFilan

3y

6

27 Algorithmic Similarity

LukasM

3y

10

24 Implications of Quantum Computing for Artificial Intelligence Alignment Research

Jsevillamol

3y

3

2 Corrigibility thoughts I: caring about multiple things

Stuart_Armstrong

5y

0