Go Back
Choose this branch
You can't go any further
meritocratic
regular
democratic
hot
top
alive
3 posts
Project Announcement
Encultured AI (org)
19 posts
Conjecture (org)
105
Announcing Encultured AI: Building a Video Game
Andrew_Critch
4mo
26
32
Encultured AI Pre-planning, Part 2: Providing a Service
Andrew_Critch
4mo
4
70
Announcing the Vitalik Buterin Fellowships in AI Existential Safety!
DanielFilan
1y
2
64
[Interim research report] Taking features out of superposition with sparse autoencoders
Lee Sharkey
7d
10
143
Conjecture: a retrospective after 8 months of work
Connor Leahy
27d
9
96
The Singular Value Decompositions of Transformer Weight Matrices are Highly Interpretable
beren
22d
27
108
What I Learned Running Refine
adamShimi
26d
5
178
Mysteries of mode collapse
janus
1mo
35
52
Conjecture Second Hiring Round
Connor Leahy
27d
0
31
Searching for Search
NicholasKees
22d
6
41
Current themes in mechanistic interpretability research
Lee Sharkey
1mo
3
56
Interpreting Neural Networks through the Polytope Lens
Sid Black
2mo
26
68
How to Diversify Conceptual Alignment: the Model Behind Refine
adamShimi
5mo
11
118
We Are Conjecture, A New Alignment Research Startup
Connor Leahy
8mo
24
61
Circumventing interpretability: How to defeat mind-readers
Lee Sharkey
5mo
8
47
Refine's First Blog Post Day
adamShimi
4mo
3
41
Abstracting The Hardness of Alignment: Unbounded Atomic Optimization
adamShimi
4mo
3