Go Back
Choose this branch
You can't go any further
meritocratic
regular
democratic
hot
top
alive
29 posts
Language Models
Definitions
PaLM
Prompt Engineering
Robotics
10 posts
Scaling Laws
165
Language models seem to be much better than humans at next-token prediction
Buck
4mo
56
140
Who models the models that model models? An exploration of GPT-3's in-context model fitting ability
Lovre
6mo
14
127
Transformer Circuits
evhub
12mo
4
123
The case for becoming a black-box investigator of language models
Buck
7mo
19
116
RL with KL penalties is better seen as Bayesian inference
Tomek Korbak
6mo
15
101
Inverse Scaling Prize: Round 1 Winners
Ethan Perez
2mo
16
100
Testing PaLM prompts on GPT3
Yitz
8mo
15
89
Help ARC evaluate capabilities of current language models (still need people)
Beth Barnes
5mo
6
75
Language Model Alignment Research Internships
Ethan Perez
1y
1
70
Gears-Level Mental Models of Transformer Interpretability
KevinRoWang
8mo
4
64
Paper: Large Language Models Can Self-improve [Linkpost]
Evan R. Murphy
2mo
14
61
Deep learning curriculum for large language model alignment
Jacob_Hilton
5mo
3
58
Conditioning Generative Models for Alignment
Jozdien
5mo
8
50
NLP Position Paper: When Combatting Hype, Proceed with Caution
Sam Bowman
1y
15
494
chinchilla's wild implications
nostalgebraist
4mo
114
191
Announcing the Inverse Scaling Prize ($250k Prize Pool)
Ethan Perez
5mo
14
101
Causal confusion as an argument against the scaling hypothesis
RobertKirk
6mo
30
80
Thoughts on the Alignment Implications of Scaling Language Models
leogao
1y
11
60
Parameter counts in Machine Learning
Jsevillamol
1y
16
54
[Link] Training Compute-Optimal Large Language Models
nostalgebraist
8mo
23
52
NVIDIA and Microsoft releases 530B parameter transformer model, Megatron-Turing NLG
Ozyrus
1y
36
49
Smoke without fire is scary
Adam Jermyn
2mo
22
26
Inverse scaling can become U-shaped
Edouard Harris
1mo
15
25
Updates on scaling laws for foundation models from ' Transcending Scaling Laws with 0.1% Extra Compute'
Nick_Greig
1mo
2