Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
42 posts
Outer Alignment
Mesa-Optimization
Neuroscience
Neuromorphic AI
Predictive Processing
Neocortex
Computing Overhang
Planning & Decision-Making
Intentionality
Hansonian Pre-Rationality
Emergent Behavior ( Emergence )
29 posts
Optimization
General Intelligence
AI Services (CAIS)
Selection vs Control
Distinctions
Adaptation Executors
Narrow AI
World Modeling Techniques
60
Paper: Constitutional AI: Harmlessness from AI Feedback (Anthropic)
LawrenceC
4d
10
61
My take on Jacob Cannell’s take on AGI safety
Steven Byrnes
22d
13
68
Human Mimicry Mainly Works When We’re Already Close
johnswentworth
4mo
16
55
Agency As a Natural Abstraction
Thane Ruthenis
7mo
9
136
Inner Alignment in Salt-Starved Rats
Steven Byrnes
2y
39
144
My computational framework for the brain
Steven Byrnes
2y
26
110
Book review: "A Thousand Brains" by Jeff Hawkins
Steven Byrnes
1y
18
41
[Intro to brain-like-AGI safety] 8. Takeaways from neuro 1/2: On AGI development
Steven Byrnes
9mo
2
147
Matt Botvinick on the spontaneous emergence of learning algorithms
Adam Scholl
2y
87
64
Brain-inspired AGI and the "lifetime anchor"
Steven Byrnes
1y
16
43
[Intro to brain-like-AGI safety] 2. “Learning from scratch” in the brain
Steven Byrnes
10mo
12
54
Meta learning to gradient hack
Quintin Pope
1y
11
166
Risks from Learned Optimization: Introduction
evhub
3y
42
24
[ASoT] Some thoughts about deceptive mesaoptimization
leogao
8mo
5
37
Don't align agents to evaluations of plans
TurnTrout
24d
46
14
Take 6: CAIS is actually Orwellian.
Charlie Steiner
13d
5
103
What's General-Purpose Search, And Why Might We Expect To See It In Trained ML Systems?
johnswentworth
4mo
15
52
Humans aren't fitness maximizers
So8res
2mo
45
217
The ground of optimization
Alex Flint
2y
74
51
Ngo and Yudkowsky on scientific reasoning and pivotal acts
Eliezer Yudkowsky
10mo
13
68
Optimization Concepts in the Game of Life
Vika
1y
15
26
Bits of Optimization Can Only Be Lost Over A Distance
johnswentworth
7mo
15
47
Measurement, Optimization, and Take-off Speed
jsteinhardt
1y
4
139
Selection vs Control
abramdemski
3y
25
58
Reflective Bayesianism
abramdemski
1y
27
30
Understanding Gradient Hacking
peterbarnett
1y
5
78
How special are human brains among animal brains?
zhukeepa
2y
38
118
Reframing Superintelligence: Comprehensive AI Services as General Intelligence
Rohin Shah
3y
75