Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
2595 posts
AI
AI Timelines
AI Takeoff
Interpretability (ML & AI)
Careers
Instrumental Convergence
Iterated Amplification
Corrigibility
Audio
Debate (AI safety technique)
Infra-Bayesianism
DeepMind
488 posts
GPT
Conjecture (org)
Art
Music
Machine Learning (ML)
Bounties & Prizes (active)
OpenAI
QURI
Language Models
Project Announcement
DALL-E
Meta-Humor
531
(My understanding of) What Everyone in Technical Alignment is Doing and Why
Thomas Larsen
3mo
83
446
A Mechanistic Interpretability Analysis of Grokking
Neel Nanda
4mo
39
436
How To Get Into Independent Research On Alignment/Agency
johnswentworth
1y
33
432
DeepMind alignment team opinions on AGI ruin arguments
Vika
4mo
34
404
Without specific countermeasures, the easiest path to transformative AI likely leads to AI takeover
Ajeya Cotra
5mo
89
394
Why I think strong general AI is coming soon
porby
2mo
126
373
We Choose To Align AI
johnswentworth
11mo
15
332
Two-year update on my personal AI timelines
Ajeya Cotra
4mo
60
331
What should you change in response to an "emergency"? And AI risk
AnnaSalamon
5mo
60
323
A challenge for AGI organizations, and a challenge for readers
Rob Bensinger
19d
30
314
Why Agent Foundations? An Overly Abstract Explanation
johnswentworth
9mo
54
310
Are we in an AI overhang?
Andy Jones
2y
109
291
Fun with +12 OOMs of Compute
Daniel Kokotajlo
1y
78
287
Don't die with dignity; instead play to your outs
Jeffrey Ladish
8mo
58
808
Simulators
janus
3mo
103
521
chinchilla's wild implications
nostalgebraist
4mo
114
415
What DALL-E 2 can and cannot do
Swimmer963
7mo
305
314
Jailbreaking ChatGPT on Release Day
Zvi
18d
74
267
We Are Conjecture, A New Alignment Research Startup
Connor Leahy
8mo
24
267
New Scaling Laws for Large Language Models
1a3orn
8mo
21
264
Common misconceptions about OpenAI
Jacob_Hilton
3mo
138
262
Mysteries of mode collapse
janus
1mo
35
234
Conjecture: a retrospective after 8 months of work
Connor Leahy
27d
9
234
Connor Leahy on Dying with Dignity, EleutherAI and Conjecture
Michaël Trazzi
5mo
29
220
Humans Who Are Not Concentrating Are Not General Intelligences
sarahconstantin
3y
35
213
Playing with DALL·E 2
Dave Orr
8mo
116
210
Hiring engineers and researchers to help align GPT-3
paulfchristiano
2y
14
202
Announcing the Inverse Scaling Prize ($250k Prize Pool)
Ethan Perez
5mo
14