Tree of Tags

Go Back

Choose this branch

You can't go any further

meritocratic regular democratic

hot top alive

1854 posts AI AI Sentience Truthful AI

1 posts

84 Towards Hodge-podge Alignment

Cleo Nardo

1d

20

198 The next decades might be wild

Marius Hobbhahn

5d

21

6 I believe some AI doomers are overconfident

FTPickle

6h

4

41 The "Minimal Latents" Approach to Natural Abstractions

johnswentworth

22h

6

52 Existential AI Safety is NOT separate from near-term applications

scasper

7d

15

11 Will Machines Ever Rule the World? MLAISU W50

Esben Kran

4d

4

89 Trying to disambiguate different questions about whether RLHF is “good”

Buck

6d

39

282 AGI Safety FAQ / all-dumb-questions-allowed thread

Aryeh Englander

6mo

514

19 Why mechanistic interpretability does not and cannot contribute to long-term AGI safety (from messages with a friend)

Remmelt

1d

6

190 Using GPT-Eliezer against ChatGPT Jailbreaking

Stuart_Armstrong

14d

77

25 If Wentworth is right about natural abstractions, it would be bad for alignment

Wuschel Schulz

12d

5

111 Revisiting algorithmic progress

Tamay

7d

6

74 Predicting GPU performance

Marius Hobbhahn

6d

24

35 Is the AI timeline too short to have children?

Yoreth

6d

20

15 Truthfulness, standards and credibility

Joe_Collman

8mo

2