Tree of Tags

Go Back

Choose this branch

You can't go any further

meritocratic regular democratic

hot top alive

29 posts Language Models Definitions PaLM Prompt Engineering Robotics

10 posts Scaling Laws

164 Language models seem to be much better than humans at next-token prediction

Buck

4mo

56

142 Transformer Circuits

evhub

12mo

4

118 The case for becoming a black-box investigator of language models

Buck

7mo

19

112 Who models the models that model models? An exploration of GPT-3's in-context model fitting ability

Lovre

6mo

14

103 Testing PaLM prompts on GPT3

Yitz

8mo

15

94 Help ARC evaluate capabilities of current language models (still need people)

Beth Barnes

5mo

6

90 RL with KL penalties is better seen as Bayesian inference

Tomek Korbak

6mo

15

88 Inverse Scaling Prize: Round 1 Winners

Ethan Perez

2mo

16

68 Language Model Alignment Research Internships

Ethan Perez

1y

1

56 Gears-Level Mental Models of Transformer Interpretability

KevinRoWang

8mo

4

53 Deep learning curriculum for large language model alignment

Jacob_Hilton

5mo

3

52 Paper: Large Language Models Can Self-improve [Linkpost]

Evan R. Murphy

2mo

14

46 NLP Position Paper: When Combatting Hype, Proceed with Caution

Sam Bowman

1y

15

40 The Problem With The Current State of AGI Definitions

Yitz

6mo

22

364 chinchilla's wild implications

nostalgebraist

4mo

114

166 Announcing the Inverse Scaling Prize ($250k Prize Pool)

Ethan Perez

5mo

14

83 Causal confusion as an argument against the scaling hypothesis

RobertKirk

6mo

30

79 Thoughts on the Alignment Implications of Scaling Language Models

leogao

1y

11

51 NVIDIA and Microsoft releases 530B parameter transformer model, Megatron-Turing NLG

Ozyrus

1y

36

50 [Link] Training Compute-Optimal Large Language Models

nostalgebraist

8mo

23

47 Parameter counts in Machine Learning

Jsevillamol

1y

16

47 Smoke without fire is scary

Adam Jermyn

2mo

22

27 Inverse scaling can become U-shaped

Edouard Harris

1mo

15

15 Updates on scaling laws for foundation models from ' Transcending Scaling Laws with 0.1% Extra Compute'

Nick_Greig

1mo

2