Go Back
Choose this branch
You can't go any further
meritocratic
regular
democratic
hot
top
alive
29 posts
Language Models
Definitions
PaLM
Prompt Engineering
Robotics
10 posts
Scaling Laws
164
Language models seem to be much better than humans at next-token prediction
Buck
4mo
56
142
Transformer Circuits
evhub
12mo
4
118
The case for becoming a black-box investigator of language models
Buck
7mo
19
112
Who models the models that model models? An exploration of GPT-3's in-context model fitting ability
Lovre
6mo
14
103
Testing PaLM prompts on GPT3
Yitz
8mo
15
94
Help ARC evaluate capabilities of current language models (still need people)
Beth Barnes
5mo
6
90
RL with KL penalties is better seen as Bayesian inference
Tomek Korbak
6mo
15
88
Inverse Scaling Prize: Round 1 Winners
Ethan Perez
2mo
16
68
Language Model Alignment Research Internships
Ethan Perez
1y
1
56
Gears-Level Mental Models of Transformer Interpretability
KevinRoWang
8mo
4
53
Deep learning curriculum for large language model alignment
Jacob_Hilton
5mo
3
52
Paper: Large Language Models Can Self-improve [Linkpost]
Evan R. Murphy
2mo
14
46
NLP Position Paper: When Combatting Hype, Proceed with Caution
Sam Bowman
1y
15
40
The Problem With The Current State of AGI Definitions
Yitz
6mo
22
364
chinchilla's wild implications
nostalgebraist
4mo
114
166
Announcing the Inverse Scaling Prize ($250k Prize Pool)
Ethan Perez
5mo
14
83
Causal confusion as an argument against the scaling hypothesis
RobertKirk
6mo
30
79
Thoughts on the Alignment Implications of Scaling Language Models
leogao
1y
11
51
NVIDIA and Microsoft releases 530B parameter transformer model, Megatron-Turing NLG
Ozyrus
1y
36
50
[Link] Training Compute-Optimal Large Language Models
nostalgebraist
8mo
23
47
Parameter counts in Machine Learning
Jsevillamol
1y
16
47
Smoke without fire is scary
Adam Jermyn
2mo
22
27
Inverse scaling can become U-shaped
Edouard Harris
1mo
15
15
Updates on scaling laws for foundation models from ' Transcending Scaling Laws with 0.1% Extra Compute'
Nick_Greig
1mo
2