Go Back
Choose this branch
You can't go any further
meritocratic
regular
democratic
hot
top
alive
1854 posts
AI
AI Sentience
Truthful AI
1 posts
84
Towards Hodge-podge Alignment
Cleo Nardo
1d
20
198
The next decades might be wild
Marius Hobbhahn
5d
21
6
I believe some AI doomers are overconfident
FTPickle
6h
4
41
The "Minimal Latents" Approach to Natural Abstractions
johnswentworth
22h
6
52
Existential AI Safety is NOT separate from near-term applications
scasper
7d
15
11
Will Machines Ever Rule the World? MLAISU W50
Esben Kran
4d
4
89
Trying to disambiguate different questions about whether RLHF is “good”
Buck
6d
39
282
AGI Safety FAQ / all-dumb-questions-allowed thread
Aryeh Englander
6mo
514
19
Why mechanistic interpretability does not and cannot contribute to long-term AGI safety (from messages with a friend)
Remmelt
1d
6
190
Using GPT-Eliezer against ChatGPT Jailbreaking
Stuart_Armstrong
14d
77
25
If Wentworth is right about natural abstractions, it would be bad for alignment
Wuschel Schulz
12d
5
111
Revisiting algorithmic progress
Tamay
7d
6
74
Predicting GPU performance
Marius Hobbhahn
6d
24
35
Is the AI timeline too short to have children?
Yoreth
6d
20
15
Truthfulness, standards and credibility
Joe_Collman
8mo
2