Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

128 posts GPT Bounties & Prizes (active) QURI AI Safety Public Materials Squiggle Generativity

112 posts Conjecture (org) Language Models Refine Project Announcement Anthropic Exploratory Engineering Encultured AI (org) Transformer Circuits Transformers

45 Next Level Seinfeld

Zvi

1d

6

91 Bad at Arithmetic, Promising at Math

cohenmacaulay

2d

17

13 Does ChatGPT’s performance warrant working on a tutor for children? [It’s time to take it to the lab.]

Bill Benzon

1d

2

237 Jailbreaking ChatGPT on Release Day

Zvi

18d

74

18 Is the ChatGPT-simulated Linux virtual machine real?

Kenoubi

7d

7

23 A crisis for online communication: bots and bot users will overrun the Internet?

Mitchell_Porter

9d

11

69 Announcing AI Alignment Awards: $100k research contests about goal misgeneralization & corrigibility

Akash

28d

20

14 Best introductory overviews of AGI safety?

Jakub Kraus

7d

5

31 [ASoT] Finetuning, RL, and GPT's world prior

Jozdien

18d

8

111 AI Timelines via Cumulative Optimization Power: Less Long, More Short

jacob_cannell

2mo

32

111 Announcing $5,000 bounty for (responsibly) ending malaria

lc

2mo

42

17 ChatGPT is surprisingly and uncanningly good at pretending to be sentient

ZT5

17d

11

18 ChatGPT: First Impressions

specbug

19d

2

7 High level discourse structure in ChatGPT: Part 2 [Quasi-symbolic?]

Bill Benzon

10d

0

27 Discovering Language Model Behaviors with Model-Written Evaluations

evhub

4h

3

29 Take 11: "Aligning language models" should be weirder.

Charlie Steiner

2d

0

80 [Interim research report] Taking features out of superposition with sparse autoencoders

Lee Sharkey

7d

10

45 Discovering Latent Knowledge in Language Models Without Supervision

Xodarap

6d

1

5 Will research in AI risk jinx it? Consequences of training AI on AI risk arguments

Yann Dubois

1d

6

183 Conjecture: a retrospective after 8 months of work

Connor Leahy

27d

9

123 Did ChatGPT just gaslight me?

ThomasW

19d

45

46 A brainteaser for language models

Adam Scherlis

8d

3

213 Mysteries of mode collapse

janus

1mo

35

103 What I Learned Running Refine

adamShimi

26d

5

26 An exploration of GPT-2's embedding weights

Adam Scherlis

7d

2

27 Tradeoffs in complexity, abstraction, and generality

remember

8d

0

472 Simulators

janus

3mo

103

85 Conjecture Second Hiring Round

Connor Leahy

27d

0