Go Back
You can't go any further
You can't go any further
meritocratic
regular
democratic
hot
top
alive
22 posts
Conjecture (org)
12 posts
Refine
59
[Interim research report] Taking features out of superposition with sparse autoencoders
Lee Sharkey
7d
10
164
Mysteries of mode collapse
janus
1mo
35
132
Conjecture: a retrospective after 8 months of work
Connor Leahy
27d
9
66
Refine: An Incubator for Conceptual Alignment Research Bets
adamShimi
8mo
13
25
Searching for Search
NicholasKees
22d
6
42
The First Filter
adamShimi
24d
5
103
What I Learned Running Refine
adamShimi
26d
5
101
Understanding Conjecture: Notes from Connor Leahy interview
Akash
3mo
24
35
AMA Conjecture, A New Alignment Startup
adamShimi
8mo
42
118
Connor Leahy on Dying with Dignity, EleutherAI and Conjecture
Michaƫl Trazzi
5mo
29
34
Current themes in mechanistic interpretability research
Lee Sharkey
1mo
3
63
How to Diversify Conceptual Alignment: the Model Behind Refine
adamShimi
5mo
11
43
Refine's First Blog Post Day
adamShimi
4mo
3
36
Abstracting The Hardness of Alignment: Unbounded Atomic Optimization
adamShimi
4mo
3
22
confusion about alignment requirements
carado
2mo
10
32
All the posts I will never write
Alexander Gietelink Oldenziel
4mo
8
45
I missed the crux of the alignment problem the whole time
zeshen
4mo
7
39
the Insulated Goal-Program idea
carado
4mo
3
22
Embedding safety in ML development
zeshen
1mo
1
12
Benchmarking Proposals on Risk Scenarios
Paul Bricman
4mo
2
23
goal-program bricks
carado
4mo
2
12
Refine's Third Blog Post Day/Week
adamShimi
3mo
0
23
Representational Tethers: Tying AI Latents To Human Ones
Paul Bricman
3mo
0
11
Refine Blogpost Day #3: The shortforms I did write
Alexander Gietelink Oldenziel
3mo
0
9
Boolean Primitives for Coupled Optimizers
Paul Bricman
2mo
0
24
(Structural) Stability of Coupled Optimizers
Paul Bricman
2mo
0