Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
22 posts
Outer Alignment
Mesa-Optimization
20 posts
Neuroscience
Neuromorphic AI
Predictive Processing
Neocortex
Computing Overhang
Planning & Decision-Making
Hansonian Pre-Rationality
Intentionality
Emergent Behavior ( Emergence )
56
Paper: Constitutional AI: Harmlessness from AI Feedback (Anthropic)
LawrenceC
4d
10
65
Human Mimicry Mainly Works When We’re Already Close
johnswentworth
4mo
16
52
Agency As a Natural Abstraction
Thane Ruthenis
7mo
9
184
Risks from Learned Optimization: Introduction
evhub
3y
42
52
Meta learning to gradient hack
Quintin Pope
1y
11
8
Inner alignment: what are we pointing at?
lcmgcd
3mo
2
24
[ASoT] Some thoughts about deceptive mesaoptimization
leogao
8mo
5
56
Formal Solution to the Inner Alignment Problem
michaelcohen
1y
123
5
Planning capacity and daemons
lcmgcd
2mo
0
44
"Inner Alignment Failures" Which Are Actually Outer Alignment Failures
johnswentworth
2y
38
75
Conditions for Mesa-Optimization
evhub
3y
48
71
Risks from Learned Optimization: Conclusion and Related Work
evhub
3y
4
22
Thoughts on gradient hacking
Richard_Ngo
1y
12
58
[AN #58] Mesa optimization: what it is, and why we should care
Rohin Shah
3y
9
47
My take on Jacob Cannell’s take on AGI safety
Steven Byrnes
22d
13
174
My computational framework for the brain
Steven Byrnes
2y
26
131
Inner Alignment in Salt-Starved Rats
Steven Byrnes
2y
39
143
Matt Botvinick on the spontaneous emergence of learning algorithms
Adam Scholl
2y
87
43
[Intro to brain-like-AGI safety] 2. “Learning from scratch” in the brain
Steven Byrnes
10mo
12
97
Book review: "A Thousand Brains" by Jeff Hawkins
Steven Byrnes
1y
18
36
[Intro to brain-like-AGI safety] 8. Takeaways from neuro 1/2: On AGI development
Steven Byrnes
9mo
2
51
Brain-inspired AGI and the "lifetime anchor"
Steven Byrnes
1y
16
45
Value loading in the human brain: a worked example
Steven Byrnes
1y
2
78
How uniform is the neocortex?
zhukeepa
2y
23
79
Inner alignment in the brain
Steven Byrnes
2y
16
47
What Decision Theory is Implied By Predictive Processing?
johnswentworth
2y
17
60
Predictive coding = RL + SL + Bayes + MPC
Steven Byrnes
3y
8
49
Building brain-inspired AGI is infinitely easier than understanding the brain
Steven Byrnes
2y
14