Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

76 posts Inner Alignment Outer Alignment Mesa-Optimization

78 posts Neuroscience Predictive Processing Neuromorphic AI Brain-Computer Interfaces Neocortex Neuralink Systems Thinking Emergent Behavior ( Emergence )

61 Paper: Constitutional AI: Harmlessness from AI Feedback (Anthropic)

LawrenceC

4d

10

84 Inner and outer alignment decompose one hard problem into two extremely hard problems

TurnTrout

18d

18

41 Mesa-Optimizers via Grokking

orthonormal

14d

4

28 Take 8: Queer the inner/outer alignment dichotomy.

Charlie Steiner

11d

2

81 Trying to Make a Treacherous Mesa-Optimizer

MadHatter

1mo

13

19 Value Formation: An Overarching Model

Thane Ruthenis

1mo

6

60 How likely is deceptive alignment?

evhub

3mo

21

22 Greed Is the Root of This Evil

Thane Ruthenis

2mo

4

36 Broad Picture of Human Values

Thane Ruthenis

4mo

5

42 Outer vs inner misalignment: three framings

Richard_Ngo

5mo

4

94 Selection Theorems: A Program For Understanding Agents

johnswentworth

1y

23

165 Inner Alignment: Explain like I'm 12 Edition

Rafael Harth

2y

46

38 [Intro to brain-like-AGI safety] 10. The alignment problem

Steven Byrnes

8mo

4

32 The Speed + Simplicity Prior is probably anti-deceptive

7mo

29

32 Predictive Processing, Heterosexuality and Delusions of Grandeur

lsusr

3d

2

72 My take on Jacob Cannell’s take on AGI safety

Steven Byrnes

22d

13

29 [Hebbian Natural Abstractions] Introduction

Samuel Nellessen

29d

3

25 Unpacking "Shard Theory" as Hunch, Question, Theory, and Insight

Jacy Reese Anthis

1mo

8

31 AI researchers announce NeuroAI agenda

Cameron Berg

1mo

12

34 Quick notes on “mirror neurons”

Steven Byrnes

2mo

2

40 On oxytocin-sensitive neurons in auditory cortex

Steven Byrnes

3mo

6

128 Predictive Coding has been Unified with Backpropagation

lsusr

1y

44

119 Book review: "A Thousand Brains" by Jeff Hawkins

Steven Byrnes

1y

18

44 [Intro to brain-like-AGI safety] 8. Takeaways from neuro 1/2: On AGI development

Steven Byrnes

9mo

2

134 Inner Alignment in Salt-Starved Rats

Steven Byrnes

2y

39

144 Matt Botvinick on the spontaneous emergence of learning algorithms

Adam Scholl

2y

87

13 (Link) I'm Missing a Chunk of My Brain

mukashi

3mo

2

41 [Intro to brain-like-AGI safety] 2. “Learning from scratch” in the brain

Steven Byrnes

10mo

12