Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
57 posts
Language Models
Transformers
5 posts
Anthropic
Transformer Circuits
472
Simulators
janus
3mo
103
223
New Scaling Laws for Large Language Models
1a3orn
8mo
21
187
The case for aligning narrowly superhuman models
Ajeya Cotra
1y
74
164
Language models seem to be much better than humans at next-token prediction
Buck
4mo
56
142
Transformer Circuits
evhub
12mo
4
123
Did ChatGPT just gaslight me?
ThomasW
19d
45
112
Who models the models that model models? An exploration of GPT-3's in-context model fitting ability
Lovre
6mo
14
110
GPT-3 Catching Fish in Morse Code
Megan Kinniment
5mo
27
103
Testing PaLM prompts on GPT3
Yitz
8mo
15
96
Paper: Teaching GPT3 to express uncertainty in words
Owain_Evans
6mo
7
94
Help ARC evaluate capabilities of current language models (still need people)
Beth Barnes
5mo
6
90
RL with KL penalties is better seen as Bayesian inference
Tomek Korbak
6mo
15
88
Inverse Scaling Prize: Round 1 Winners
Ethan Perez
2mo
16
84
A one-question Turing test for GPT-3
Paul Crowley
11mo
23
81
A Summary Of Anthropic's First Paper
Sam Ringer
11mo
0
64
Toy Models of Superposition
evhub
3mo
2
16
Understanding the tensor product formulation in Transformer Circuits
Tom Lieberum
12mo
2
11
Mechanistic Interpretability for the MLP Layers (rough early thoughts)
MadHatter
12mo
2
5
Will research in AI risk jinx it? Consequences of training AI on AI risk arguments
Yann Dubois
1d
6