Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

76 posts Inner Alignment Outer Alignment Mesa-Optimization

78 posts Neuroscience Predictive Processing Neuromorphic AI Brain-Computer Interfaces Neocortex Neuralink Systems Thinking Emergent Behavior ( Emergence )

60 Paper: Constitutional AI: Harmlessness from AI Feedback (Anthropic)

LawrenceC

4d

10

96 Inner and outer alignment decompose one hard problem into two extremely hard problems

TurnTrout

18d

18

35 Mesa-Optimizers via Grokking

orthonormal

14d

4

26 Take 8: Queer the inner/outer alignment dichotomy.

Charlie Steiner

11d

2

87 Trying to Make a Treacherous Mesa-Optimizer

MadHatter

1mo

13

20 Value Formation: An Overarching Model

Thane Ruthenis

1mo

6

72 How likely is deceptive alignment?

evhub

3mo

21

13 The Disastrously Confident And Inaccurate AI

Sharat Jacob Jacob

1mo

0

21 Greed Is the Root of This Evil

Thane Ruthenis

2mo

4

36 Broad Picture of Human Values

Thane Ruthenis

4mo

5

43 Outer vs inner misalignment: three framings

Richard_Ngo

5mo

4

8 I there a demo of "You can't fetch the coffee if you're dead"?

Ram Rachum

1mo

9

103 Selection Theorems: A Program For Understanding Agents

johnswentworth

1y

23

175 Inner Alignment: Explain like I'm 12 Edition

Rafael Harth

2y

46

30 Predictive Processing, Heterosexuality and Delusions of Grandeur

lsusr

3d

2

61 My take on Jacob Cannell’s take on AGI safety

Steven Byrnes

22d

13

34 [Hebbian Natural Abstractions] Introduction

Samuel Nellessen

29d

3

29 Unpacking "Shard Theory" as Hunch, Question, Theory, and Insight

Jacy Reese Anthis

1mo

8

37 AI researchers announce NeuroAI agenda

Cameron Berg

1mo

12

31 Quick notes on “mirror neurons”

Steven Byrnes

2mo

2

31 On oxytocin-sensitive neurons in auditory cortex

Steven Byrnes

3mo

6

165 Predictive Coding has been Unified with Backpropagation

lsusr

1y

44

62 Theoretical Neuroscience For Alignment Theory

Cameron Berg

1y

19

136 Inner Alignment in Salt-Starved Rats

Steven Byrnes

2y

39

144 My computational framework for the brain

Steven Byrnes

2y

26

110 Book review: "A Thousand Brains" by Jeff Hawkins

Steven Byrnes

1y

18

41 [Intro to brain-like-AGI safety] 8. Takeaways from neuro 1/2: On AGI development

Steven Byrnes

9mo

2

147 Matt Botvinick on the spontaneous emergence of learning algorithms

Adam Scholl

2y

87