Go Back
Choose this branch
You can't go any further
meritocratic
regular
democratic
hot
top
alive
3 posts
Project Announcement
Encultured AI (org)
19 posts
Conjecture (org)
103
Announcing Encultured AI: Building a Video Game
Andrew_Critch
4mo
26
33
Encultured AI Pre-planning, Part 2: Providing a Service
Andrew_Critch
4mo
4
64
Announcing the Vitalik Buterin Fellowships in AI Existential Safety!
DanielFilan
1y
2
80
[Interim research report] Taking features out of superposition with sparse autoencoders
Lee Sharkey
7d
10
159
The Singular Value Decompositions of Transformer Weight Matrices are Highly Interpretable
beren
22d
27
183
Conjecture: a retrospective after 8 months of work
Connor Leahy
27d
9
213
Mysteries of mode collapse
janus
1mo
35
103
What I Learned Running Refine
adamShimi
26d
5
85
Conjecture Second Hiring Round
Connor Leahy
27d
0
64
Searching for Search
NicholasKees
22d
6
82
Current themes in mechanistic interpretability research
Lee Sharkey
1mo
3
123
Interpreting Neural Networks through the Polytope Lens
Sid Black
2mo
26
186
We Are Conjecture, A New Alignment Research Startup
Connor Leahy
8mo
24
94
Circumventing interpretability: How to defeat mind-readers
Lee Sharkey
5mo
8
78
How to Diversify Conceptual Alignment: the Model Behind Refine
adamShimi
5mo
11
123
Refine: An Incubator for Conceptual Alignment Research Bets
adamShimi
8mo
13
55
Refine's First Blog Post Day
adamShimi
4mo
3