Tree of Tags

Go Back

Choose this branch

You can't go any further

meritocratic regular democratic

hot top alive

3 posts Project Announcement Encultured AI (org)

19 posts Conjecture (org)

105 Announcing Encultured AI: Building a Video Game

Andrew_Critch

4mo

26

32 Encultured AI Pre-planning, Part 2: Providing a Service

Andrew_Critch

4mo

4

70 Announcing the Vitalik Buterin Fellowships in AI Existential Safety!

DanielFilan

1y

2

64 [Interim research report] Taking features out of superposition with sparse autoencoders

Lee Sharkey

7d

10

143 Conjecture: a retrospective after 8 months of work

Connor Leahy

27d

9

96 The Singular Value Decompositions of Transformer Weight Matrices are Highly Interpretable

beren

22d

27

108 What I Learned Running Refine

adamShimi

26d

5

178 Mysteries of mode collapse

janus

1mo

35

52 Conjecture Second Hiring Round

Connor Leahy

27d

0

31 Searching for Search

NicholasKees

22d

6

41 Current themes in mechanistic interpretability research

Lee Sharkey

1mo

3

56 Interpreting Neural Networks through the Polytope Lens

Sid Black

2mo

26

68 How to Diversify Conceptual Alignment: the Model Behind Refine

adamShimi

5mo

11

118 We Are Conjecture, A New Alignment Research Startup

Connor Leahy

8mo

24

61 Circumventing interpretability: How to defeat mind-readers

Lee Sharkey

5mo

8

47 Refine's First Blog Post Day

adamShimi

4mo

3

41 Abstracting The Hardness of Alignment: Unbounded Atomic Optimization

adamShimi

4mo

3