Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
128 posts
GPT
Bounties & Prizes (active)
QURI
AI Safety Public Materials
Squiggle
Generativity
112 posts
Conjecture (org)
Language Models
Refine
Project Announcement
Anthropic
Exploratory Engineering
Encultured AI (org)
Transformer Circuits
Transformers
43
Next Level Seinfeld
Zvi
1d
6
70
Bad at Arithmetic, Promising at Math
cohenmacaulay
2d
17
11
Does ChatGPT’s performance warrant working on a tutor for children? [It’s time to take it to the lab.]
Bill Benzon
1d
2
160
Jailbreaking ChatGPT on Release Day
Zvi
18d
74
27
A crisis for online communication: bots and bot users will overrun the Internet?
Mitchell_Porter
9d
11
10
Is the ChatGPT-simulated Linux virtual machine real?
Kenoubi
7d
7
45
Announcing AI Alignment Awards: $100k research contests about goal misgeneralization & corrigibility
Akash
28d
20
96
AI Timelines via Cumulative Optimization Power: Less Long, More Short
jacob_cannell
2mo
32
19
ChatGPT: First Impressions
specbug
19d
2
13
ChatGPT is surprisingly and uncanningly good at pretending to be sentient
ZT5
17d
11
75
Announcing $5,000 bounty for (responsibly) ending malaria
lc
2mo
42
11
[LINK] - ChatGPT discussion
JanBrauner
19d
7
34
Prizes for ML Safety Benchmark Ideas
joshc
1mo
3
130
Announcing the Inverse Scaling Prize ($250k Prize Pool)
Ethan Perez
5mo
14
27
Discovering Language Model Behaviors with Model-Written Evaluations
evhub
4h
3
26
Take 11: "Aligning language models" should be weirder.
Charlie Steiner
2d
0
59
[Interim research report] Taking features out of superposition with sparse autoencoders
Lee Sharkey
7d
10
55
A brainteaser for language models
Adam Scherlis
8d
3
38
Discovering Latent Knowledge in Language Models Without Supervision
Xodarap
6d
1
98
Did ChatGPT just gaslight me?
ThomasW
19d
45
132
Conjecture: a retrospective after 8 months of work
Connor Leahy
27d
9
3
Will research in AI risk jinx it? Consequences of training AI on AI risk arguments
Yann Dubois
1d
6
103
What I Learned Running Refine
adamShimi
26d
5
164
Mysteries of mode collapse
janus
1mo
35
22
Tradeoffs in complexity, abstraction, and generality
remember
8d
0
16
An exploration of GPT-2's embedding weights
Adam Scherlis
7d
2
23
Does a LLM have a utility function?
Dagon
11d
6
42
The First Filter
adamShimi
24d
5