Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

57 posts Language Models Transformers

5 posts Anthropic Transformer Circuits

472 Simulators

janus

3mo

103

223 New Scaling Laws for Large Language Models

1a3orn

8mo

21

187 The case for aligning narrowly superhuman models

Ajeya Cotra

1y

74

164 Language models seem to be much better than humans at next-token prediction

Buck

4mo

56

142 Transformer Circuits

evhub

12mo

4

123 Did ChatGPT just gaslight me?

ThomasW

19d

45

112 Who models the models that model models? An exploration of GPT-3's in-context model fitting ability

Lovre

6mo

14

110 GPT-3 Catching Fish in Morse Code

Megan Kinniment

5mo

27

103 Testing PaLM prompts on GPT3

Yitz

8mo

15

96 Paper: Teaching GPT3 to express uncertainty in words

Owain_Evans

6mo

7

94 Help ARC evaluate capabilities of current language models (still need people)

Beth Barnes

5mo

6

90 RL with KL penalties is better seen as Bayesian inference

Tomek Korbak

6mo

15

88 Inverse Scaling Prize: Round 1 Winners

Ethan Perez

2mo

16

84 A one-question Turing test for GPT-3

Paul Crowley

11mo

23

81 A Summary Of Anthropic's First Paper

Sam Ringer

11mo

0

64 Toy Models of Superposition

evhub

3mo

2

16 Understanding the tensor product formulation in Transformer Circuits

Tom Lieberum

12mo

2

11 Mechanistic Interpretability for the MLP Layers (rough early thoughts)

MadHatter

12mo

2

5 Will research in AI risk jinx it? Consequences of training AI on AI risk arguments

Yann Dubois

1d

6