Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
76 posts
Inner Alignment
Outer Alignment
Mesa-Optimization
78 posts
Neuroscience
Predictive Processing
Neuromorphic AI
Brain-Computer Interfaces
Neocortex
Neuralink
Systems Thinking
Emergent Behavior ( Emergence )
61
Paper: Constitutional AI: Harmlessness from AI Feedback (Anthropic)
LawrenceC
4d
10
84
Inner and outer alignment decompose one hard problem into two extremely hard problems
TurnTrout
18d
18
41
Mesa-Optimizers via Grokking
orthonormal
14d
4
28
Take 8: Queer the inner/outer alignment dichotomy.
Charlie Steiner
11d
2
81
Trying to Make a Treacherous Mesa-Optimizer
MadHatter
1mo
13
19
Value Formation: An Overarching Model
Thane Ruthenis
1mo
6
60
How likely is deceptive alignment?
evhub
3mo
21
22
Greed Is the Root of This Evil
Thane Ruthenis
2mo
4
36
Broad Picture of Human Values
Thane Ruthenis
4mo
5
42
Outer vs inner misalignment: three framings
Richard_Ngo
5mo
4
94
Selection Theorems: A Program For Understanding Agents
johnswentworth
1y
23
165
Inner Alignment: Explain like I'm 12 Edition
Rafael Harth
2y
46
38
[Intro to brain-like-AGI safety] 10. The alignment problem
Steven Byrnes
8mo
4
32
The Speed + Simplicity Prior is probably anti-deceptive
7mo
29
32
Predictive Processing, Heterosexuality and Delusions of Grandeur
lsusr
3d
2
72
My take on Jacob Cannell’s take on AGI safety
Steven Byrnes
22d
13
29
[Hebbian Natural Abstractions] Introduction
Samuel Nellessen
29d
3
25
Unpacking "Shard Theory" as Hunch, Question, Theory, and Insight
Jacy Reese Anthis
1mo
8
31
AI researchers announce NeuroAI agenda
Cameron Berg
1mo
12
34
Quick notes on “mirror neurons”
Steven Byrnes
2mo
2
40
On oxytocin-sensitive neurons in auditory cortex
Steven Byrnes
3mo
6
128
Predictive Coding has been Unified with Backpropagation
lsusr
1y
44
119
Book review: "A Thousand Brains" by Jeff Hawkins
Steven Byrnes
1y
18
44
[Intro to brain-like-AGI safety] 8. Takeaways from neuro 1/2: On AGI development
Steven Byrnes
9mo
2
134
Inner Alignment in Salt-Starved Rats
Steven Byrnes
2y
39
144
Matt Botvinick on the spontaneous emergence of learning algorithms
Adam Scholl
2y
87
13
(Link) I'm Missing a Chunk of My Brain
mukashi
3mo
2
41
[Intro to brain-like-AGI safety] 2. “Learning from scratch” in the brain
Steven Byrnes
10mo
12