Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

128 posts GPT Bounties & Prizes (active) QURI AI Safety Public Materials Squiggle Generativity

112 posts Conjecture (org) Language Models Refine Project Announcement Anthropic Exploratory Engineering Encultured AI (org) Transformer Circuits Transformers

70 Bad at Arithmetic, Promising at Math

cohenmacaulay

2d

17

43 Next Level Seinfeld

Zvi

1d

6

160 Jailbreaking ChatGPT on Release Day

Zvi

18d

74

27 A crisis for online communication: bots and bot users will overrun the Internet?

Mitchell_Porter

9d

11

11 Does ChatGPT’s performance warrant working on a tutor for children? [It’s time to take it to the lab.]

Bill Benzon

1d

2

11 [LINK] - ChatGPT discussion

JanBrauner

19d

7

-23 Could an AI be Religious?

mk54

16d

14

10 Is the ChatGPT-simulated Linux virtual machine real?

Kenoubi

7d

7

2 ChatGPT: "An error occurred. If this issue persists..."

Bill Benzon

13d

11

9 What is the best article to introduce someone to AI safety for the first time?

Trevor1

28d

7

2 Best introductory overviews of AGI safety?

Jakub Kraus

7d

5

45 Announcing AI Alignment Awards: $100k research contests about goal misgeneralization & corrigibility

Akash

28d

20

13 ChatGPT is surprisingly and uncanningly good at pretending to be sentient

ZT5

17d

11

4 Trivial GPT-3.5 limitation workaround

Dave Lindbergh

8d

4

27 Discovering Language Model Behaviors with Model-Written Evaluations

evhub

4h

3

3 Will research in AI risk jinx it? Consequences of training AI on AI risk arguments

Yann Dubois

1d

6

98 Did ChatGPT just gaslight me?

ThomasW

19d

45

59 [Interim research report] Taking features out of superposition with sparse autoencoders

Lee Sharkey

7d

10

164 Mysteries of mode collapse

janus

1mo

35

136 Simulators

janus

3mo

103

17 Shh, don't tell the AI it's likely to be evil

naterush

14d

9

-4 Simulators and Mindcrime

DragonGod

11d

4

23 Does a LLM have a utility function?

Dagon

11d

6

17 Gliders in Language Models

Alexandre Variengien

25d

11

55 A brainteaser for language models

Adam Scherlis

8d

3

155 Language models seem to be much better than humans at next-token prediction

Buck

4mo

56

132 Conjecture: a retrospective after 8 months of work

Connor Leahy

27d

9

66 Refine: An Incubator for Conceptual Alignment Research Bets

adamShimi

8mo

13