Tree of Tags

Go Back

Choose this branch

You can't go any further

meritocratic regular democratic

hot top alive

24 posts Language Models Robotics

0 posts

27 Discovering Language Model Behaviors with Model-Written Evaluations

evhub

4h

3

29 Take 11: "Aligning language models" should be weirder.

Charlie Steiner

2d

0

164 Language models seem to be much better than humans at next-token prediction

Buck

4mo

56

88 Inverse Scaling Prize: Round 1 Winners

Ethan Perez

2mo

16

52 Paper: Large Language Models Can Self-improve [Linkpost]

Evan R. Murphy

2mo

14

94 Help ARC evaluate capabilities of current language models (still need people)

Beth Barnes

5mo

6

112 Who models the models that model models? An exploration of GPT-3's in-context model fitting ability

Lovre

6mo

14

90 RL with KL penalties is better seen as Bayesian inference

Tomek Korbak

6mo

15

142 Transformer Circuits

evhub

12mo

4

53 Deep learning curriculum for large language model alignment

Jacob_Hilton

5mo

3

28 Strategy For Conditioning Generative Models

james.lucassen

3mo

4

40 Conditioning Generative Models for Alignment

Jozdien

5mo

8

56 Gears-Level Mental Models of Transformer Interpretability

KevinRoWang

8mo

4

68 Language Model Alignment Research Internships

Ethan Perez

1y

1