Tree of Tags

Go Back

Choose this branch

You can't go any further

meritocratic regular democratic

hot top alive

29 posts Language Models Definitions PaLM Prompt Engineering Robotics

10 posts Scaling Laws

165 Language models seem to be much better than humans at next-token prediction

Buck

4mo

56

140 Who models the models that model models? An exploration of GPT-3's in-context model fitting ability

Lovre

6mo

14

127 Transformer Circuits

evhub

12mo

4

123 The case for becoming a black-box investigator of language models

Buck

7mo

19

116 RL with KL penalties is better seen as Bayesian inference

Tomek Korbak

6mo

15

101 Inverse Scaling Prize: Round 1 Winners

Ethan Perez

2mo

16

100 Testing PaLM prompts on GPT3

Yitz

8mo

15

89 Help ARC evaluate capabilities of current language models (still need people)

Beth Barnes

5mo

6

75 Language Model Alignment Research Internships

Ethan Perez

1y

1

70 Gears-Level Mental Models of Transformer Interpretability

KevinRoWang

8mo

4

64 Paper: Large Language Models Can Self-improve [Linkpost]

Evan R. Murphy

2mo

14

61 Deep learning curriculum for large language model alignment

Jacob_Hilton

5mo

3

58 Conditioning Generative Models for Alignment

Jozdien

5mo

8

50 NLP Position Paper: When Combatting Hype, Proceed with Caution

Sam Bowman

1y

15

494 chinchilla's wild implications

nostalgebraist

4mo

114

191 Announcing the Inverse Scaling Prize ($250k Prize Pool)

Ethan Perez

5mo

14

101 Causal confusion as an argument against the scaling hypothesis

RobertKirk

6mo

30

80 Thoughts on the Alignment Implications of Scaling Language Models

leogao

1y

11

60 Parameter counts in Machine Learning

Jsevillamol

1y

16

54 [Link] Training Compute-Optimal Large Language Models

nostalgebraist

8mo

23

52 NVIDIA and Microsoft releases 530B parameter transformer model, Megatron-Turing NLG

Ozyrus

1y

36

49 Smoke without fire is scary

Adam Jermyn

2mo

22

26 Inverse scaling can become U-shaped

Edouard Harris

1mo

15

25 Updates on scaling laws for foundation models from ' Transcending Scaling Laws with 0.1% Extra Compute'

Nick_Greig

1mo

2