Go Back
Choose this branch
You can't go any further
meritocratic
regular
democratic
hot
top
alive
3 posts
Project Announcement
Encultured AI (org)
19 posts
Conjecture (org)
103
Announcing Encultured AI: Building a Video Game
Andrew_Critch
4mo
26
64
Announcing the Vitalik Buterin Fellowships in AI Existential Safety!
DanielFilan
1y
2
33
Encultured AI Pre-planning, Part 2: Providing a Service
Andrew_Critch
4mo
4
213
Mysteries of mode collapse
janus
1mo
35
186
We Are Conjecture, A New Alignment Research Startup
Connor Leahy
8mo
24
183
Conjecture: a retrospective after 8 months of work
Connor Leahy
27d
9
159
The Singular Value Decompositions of Transformer Weight Matrices are Highly Interpretable
beren
22d
27
123
Refine: An Incubator for Conceptual Alignment Research Bets
adamShimi
8mo
13
123
Interpreting Neural Networks through the Polytope Lens
Sid Black
2mo
26
103
What I Learned Running Refine
adamShimi
26d
5
94
Circumventing interpretability: How to defeat mind-readers
Lee Sharkey
5mo
8
85
Conjecture Second Hiring Round
Connor Leahy
27d
0
82
Current themes in mechanistic interpretability research
Lee Sharkey
1mo
3
80
[Interim research report] Taking features out of superposition with sparse autoencoders
Lee Sharkey
7d
10
78
How to Diversify Conceptual Alignment: the Model Behind Refine
adamShimi
5mo
11
64
Searching for Search
NicholasKees
22d
6
62
Abstracting The Hardness of Alignment: Unbounded Atomic Optimization
adamShimi
4mo
3