Tree of Tags

Go Back

Choose this branch

You can't go any further

meritocratic regular democratic

hot top alive

3 posts Project Announcement Encultured AI (org)

19 posts Conjecture (org)

103 Announcing Encultured AI: Building a Video Game

Andrew_Critch

4mo

26

33 Encultured AI Pre-planning, Part 2: Providing a Service

Andrew_Critch

4mo

4

64 Announcing the Vitalik Buterin Fellowships in AI Existential Safety!

DanielFilan

1y

2

80 [Interim research report] Taking features out of superposition with sparse autoencoders

Lee Sharkey

7d

10

159 The Singular Value Decompositions of Transformer Weight Matrices are Highly Interpretable

beren

22d

27

183 Conjecture: a retrospective after 8 months of work

Connor Leahy

27d

9

213 Mysteries of mode collapse

janus

1mo

35

103 What I Learned Running Refine

adamShimi

26d

5

85 Conjecture Second Hiring Round

Connor Leahy

27d

0

64 Searching for Search

NicholasKees

22d

6

82 Current themes in mechanistic interpretability research

Lee Sharkey

1mo

3

123 Interpreting Neural Networks through the Polytope Lens

Sid Black

2mo

26

186 We Are Conjecture, A New Alignment Research Startup

Connor Leahy

8mo

24

94 Circumventing interpretability: How to defeat mind-readers

Lee Sharkey

5mo

8

78 How to Diversify Conceptual Alignment: the Model Behind Refine

adamShimi

5mo

11

123 Refine: An Incubator for Conceptual Alignment Research Bets

adamShimi

8mo

13

55 Refine's First Blog Post Day

adamShimi

4mo

3