Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

22 posts Outer Alignment Mesa-Optimization

20 posts Neuroscience Neuromorphic AI Predictive Processing Neocortex Computing Overhang Planning & Decision-Making Hansonian Pre-Rationality Intentionality Emergent Behavior ( Emergence )

60 Paper: Constitutional AI: Harmlessness from AI Feedback (Anthropic)

LawrenceC

4d

10

68 Human Mimicry Mainly Works When We’re Already Close

johnswentworth

4mo

16

7 Inner alignment: what are we pointing at?

lcmgcd

3mo

2

24 Outer alignment and imitative amplification

evhub

2y

11

78 Risks from Learned Optimization: Conclusion and Related Work

evhub

3y

4

43 The Steering Problem

paulfchristiano

4y

12

47 Formal Solution to the Inner Alignment Problem

michaelcohen

1y

123

62 An Increasingly Manipulative Newsfeed

Michaël Trazzi

3y

16

166 Risks from Learned Optimization: Introduction

evhub

3y

42

61 "Inner Alignment Failures" Which Are Actually Outer Alignment Failures

johnswentworth

2y

38

22 If I were a well-intentioned AI... III: Extremal Goodhart

Stuart_Armstrong

2y

0

54 Mesa-Search vs Mesa-Control

abramdemski

2y

45

54 [AN #58] Mesa optimization: what it is, and why we should care

Rohin Shah

3y

9

20 If I were a well-intentioned AI... II: Acting in a world

Stuart_Armstrong

2y

0

61 My take on Jacob Cannell’s take on AGI safety

Steven Byrnes

22d

13

60 Multi-agent predictive minds and AI alignment

Jan_Kulveit

4y

18

18 Gary Marcus vs Cortical Uniformity

Steven Byrnes

2y

0

55 What Decision Theory is Implied By Predictive Processing?

johnswentworth

2y

17

20 Towards an Intentional Research Agenda

romeostevensit

3y

8

15 Minimization of prediction error as a foundation for human values in AI alignment

Gordon Seidoh Worley

3y

42

76 Inner alignment in the brain

Steven Byrnes

2y

16

58 Human instincts, symbol grounding, and the blank-slate neocortex

Steven Byrnes

3y

23

51 Building brain-inspired AGI is infinitely easier than understanding the brain

Steven Byrnes

2y

14

64 Brain-inspired AGI and the "lifetime anchor"

Steven Byrnes

1y

16

78 How uniform is the neocortex?

zhukeepa

2y

23

147 Matt Botvinick on the spontaneous emergence of learning algorithms

Adam Scholl

2y

87

53 Predictive coding = RL + SL + Bayes + MPC

Steven Byrnes

3y

8

144 My computational framework for the brain

Steven Byrnes

2y

26