Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
128 posts
GPT
Bounties & Prizes (active)
QURI
AI Safety Public Materials
Squiggle
Generativity
112 posts
Conjecture (org)
Language Models
Refine
Project Announcement
Anthropic
Exploratory Engineering
Encultured AI (org)
Transformer Circuits
Transformers
45
Next Level Seinfeld
Zvi
1d
6
91
Bad at Arithmetic, Promising at Math
cohenmacaulay
2d
17
13
Does ChatGPT’s performance warrant working on a tutor for children? [It’s time to take it to the lab.]
Bill Benzon
1d
2
237
Jailbreaking ChatGPT on Release Day
Zvi
18d
74
18
Is the ChatGPT-simulated Linux virtual machine real?
Kenoubi
7d
7
23
A crisis for online communication: bots and bot users will overrun the Internet?
Mitchell_Porter
9d
11
69
Announcing AI Alignment Awards: $100k research contests about goal misgeneralization & corrigibility
Akash
28d
20
14
Best introductory overviews of AGI safety?
Jakub Kraus
7d
5
31
[ASoT] Finetuning, RL, and GPT's world prior
Jozdien
18d
8
111
AI Timelines via Cumulative Optimization Power: Less Long, More Short
jacob_cannell
2mo
32
111
Announcing $5,000 bounty for (responsibly) ending malaria
lc
2mo
42
17
ChatGPT is surprisingly and uncanningly good at pretending to be sentient
ZT5
17d
11
18
ChatGPT: First Impressions
specbug
19d
2
7
High level discourse structure in ChatGPT: Part 2 [Quasi-symbolic?]
Bill Benzon
10d
0
27
Discovering Language Model Behaviors with Model-Written Evaluations
evhub
4h
3
29
Take 11: "Aligning language models" should be weirder.
Charlie Steiner
2d
0
80
[Interim research report] Taking features out of superposition with sparse autoencoders
Lee Sharkey
7d
10
45
Discovering Latent Knowledge in Language Models Without Supervision
Xodarap
6d
1
5
Will research in AI risk jinx it? Consequences of training AI on AI risk arguments
Yann Dubois
1d
6
183
Conjecture: a retrospective after 8 months of work
Connor Leahy
27d
9
123
Did ChatGPT just gaslight me?
ThomasW
19d
45
46
A brainteaser for language models
Adam Scherlis
8d
3
213
Mysteries of mode collapse
janus
1mo
35
103
What I Learned Running Refine
adamShimi
26d
5
26
An exploration of GPT-2's embedding weights
Adam Scherlis
7d
2
27
Tradeoffs in complexity, abstraction, and generality
remember
8d
0
472
Simulators
janus
3mo
103
85
Conjecture Second Hiring Round
Connor Leahy
27d
0