Go Back
Choose this branch
You can't go any further
meritocratic
regular
democratic
hot
top
alive
24 posts
Language Models
Robotics
0 posts
27
Discovering Language Model Behaviors with Model-Written Evaluations
evhub
4h
3
29
Take 11: "Aligning language models" should be weirder.
Charlie Steiner
2d
0
164
Language models seem to be much better than humans at next-token prediction
Buck
4mo
56
88
Inverse Scaling Prize: Round 1 Winners
Ethan Perez
2mo
16
52
Paper: Large Language Models Can Self-improve [Linkpost]
Evan R. Murphy
2mo
14
94
Help ARC evaluate capabilities of current language models (still need people)
Beth Barnes
5mo
6
112
Who models the models that model models? An exploration of GPT-3's in-context model fitting ability
Lovre
6mo
14
90
RL with KL penalties is better seen as Bayesian inference
Tomek Korbak
6mo
15
142
Transformer Circuits
evhub
12mo
4
53
Deep learning curriculum for large language model alignment
Jacob_Hilton
5mo
3
28
Strategy For Conditioning Generative Models
james.lucassen
3mo
4
40
Conditioning Generative Models for Alignment
Jozdien
5mo
8
56
Gears-Level Mental Models of Transformer Interpretability
KevinRoWang
8mo
4
68
Language Model Alignment Research Internships
Ethan Perez
1y
1