Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

128 posts GPT Bounties & Prizes (active) QURI AI Safety Public Materials Squiggle Generativity

112 posts Conjecture (org) Language Models Refine Project Announcement Anthropic Exploratory Engineering Encultured AI (org) Transformer Circuits Transformers

43 Next Level Seinfeld

Zvi

1d

6

70 Bad at Arithmetic, Promising at Math

cohenmacaulay

2d

17

11 Does ChatGPT’s performance warrant working on a tutor for children? [It’s time to take it to the lab.]

Bill Benzon

1d

2

160 Jailbreaking ChatGPT on Release Day

Zvi

18d

74

27 A crisis for online communication: bots and bot users will overrun the Internet?

Mitchell_Porter

9d

11

10 Is the ChatGPT-simulated Linux virtual machine real?

Kenoubi

7d

7

45 Announcing AI Alignment Awards: $100k research contests about goal misgeneralization & corrigibility

Akash

28d

20

96 AI Timelines via Cumulative Optimization Power: Less Long, More Short

jacob_cannell

2mo

32

19 ChatGPT: First Impressions

specbug

19d

2

13 ChatGPT is surprisingly and uncanningly good at pretending to be sentient

ZT5

17d

11

75 Announcing $5,000 bounty for (responsibly) ending malaria

lc

2mo

42

11 [LINK] - ChatGPT discussion

JanBrauner

19d

7

34 Prizes for ML Safety Benchmark Ideas

joshc

1mo

3

130 Announcing the Inverse Scaling Prize ($250k Prize Pool)

Ethan Perez

5mo

14

27 Discovering Language Model Behaviors with Model-Written Evaluations

evhub

4h

3

26 Take 11: "Aligning language models" should be weirder.

Charlie Steiner

2d

0

59 [Interim research report] Taking features out of superposition with sparse autoencoders

Lee Sharkey

7d

10

55 A brainteaser for language models

Adam Scherlis

8d

3

38 Discovering Latent Knowledge in Language Models Without Supervision

Xodarap

6d

1

98 Did ChatGPT just gaslight me?

ThomasW

19d

45

132 Conjecture: a retrospective after 8 months of work

Connor Leahy

27d

9

3 Will research in AI risk jinx it? Consequences of training AI on AI risk arguments

Yann Dubois

1d

6

103 What I Learned Running Refine

adamShimi

26d

5

164 Mysteries of mode collapse

janus

1mo

35

22 Tradeoffs in complexity, abstraction, and generality

remember

8d

0

16 An exploration of GPT-2's embedding weights

Adam Scherlis

7d

2

23 Does a LLM have a utility function?

Dagon

11d

6

42 The First Filter

adamShimi

24d

5