Tree of Tags

Go Back

Choose this branch

You can't go any further

meritocratic regular democratic

hot top alive

3 posts Project Announcement Encultured AI (org)

19 posts Conjecture (org)

105 Announcing Encultured AI: Building a Video Game

Andrew_Critch

4mo

26

70 Announcing the Vitalik Buterin Fellowships in AI Existential Safety!

DanielFilan

1y

2

32 Encultured AI Pre-planning, Part 2: Providing a Service

Andrew_Critch

4mo

4

178 Mysteries of mode collapse

janus

1mo

35

143 Conjecture: a retrospective after 8 months of work

Connor Leahy

27d

9

118 We Are Conjecture, A New Alignment Research Startup

Connor Leahy

8mo

24

108 What I Learned Running Refine

adamShimi

26d

5

96 The Singular Value Decompositions of Transformer Weight Matrices are Highly Interpretable

beren

22d

27

76 Refine: An Incubator for Conceptual Alignment Research Bets

adamShimi

8mo

13

68 How to Diversify Conceptual Alignment: the Model Behind Refine

adamShimi

5mo

11

64 [Interim research report] Taking features out of superposition with sparse autoencoders

Lee Sharkey

7d

10

61 Circumventing interpretability: How to defeat mind-readers

Lee Sharkey

5mo

8

56 Interpreting Neural Networks through the Polytope Lens

Sid Black

2mo

26

52 Conjecture Second Hiring Round

Connor Leahy

27d

0

47 Refine's First Blog Post Day

adamShimi

4mo

3

41 Current themes in mechanistic interpretability research

Lee Sharkey

1mo

3

41 Abstracting The Hardness of Alignment: Unbounded Atomic Optimization

adamShimi

4mo

3