Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

31 posts SERI MATS AI Alignment Fieldbuilding Intellectual Progress (Society-Level) Distillation & Pedagogy Practice & Philosophy of Science Information Hazards PIBBSS Intellectual Progress via LessWrong Economic Consequences of AGI Privacy Superintelligence Automation

532 posts Epistemology Intellectual Progress (Individual-Level) Research Taste Epistemic Review Selection Effects Social & Cultural Dynamics Humility

265 Lessons learned from talking to >100 academics about AI safety

Marius Hobbhahn

2mo

16

221 Call For Distillers

johnswentworth

8mo

42

167 Conjecture: Internal Infohazard Policy

Connor Leahy

4mo

6

158 Most People Start With The Same Few Bad Ideas

johnswentworth

3mo

30

144 Your posts should be on arXiv

JanBrauner

3mo

39

133 The Fusion Power Generator Scenario

johnswentworth

2y

29

100 Productive Mistakes, Not Perfect Answers

adamShimi

8mo

11

99 On Solving Problems Before They Appear: The Weird Epistemologies of Alignment

adamShimi

1y

11

97 Intuitions about solving hard problems

Richard_Ngo

7mo

23

90 Intermittent Distillations #4: Semiconductors, Economics, Intelligence, and Technological Progress.

Mark Xu

1y

9

86 ML Alignment Theory Program under Evan Hubinger

Oliver Zhang

1y

3

76 SERI MATS Program - Winter 2022 Cohort

Ryan Kidd

2mo

12

54 Principles of Privacy for Alignment Research

johnswentworth

4mo

30

53 [An email with a bunch of links I sent an experienced ML researcher interested in learning about Alignment / x-safety.]

David Scott Krueger (formerly: capybaralet)

3mo

1

305 Alignment Research Field Guide

abramdemski

3y

9

73 How to do theoretical research, a personal perspective

Mark Xu

4mo

4

65 How I Formed My Own Views About AI Safety

Neel Nanda

9mo

6

55 Methodological Therapy: An Agenda For Tackling Research Bottlenecks

adamShimi

2mo

6

43 Shapes of Mind and Pluralism in Alignment

adamShimi

4mo

1

38 Torture and Dust Specks and Joy--Oh my! or: Non-Archimedean Utility Functions as Pseudograded Vector Spaces

Louis_Brown

3y

29

32 AI Alignment Open Thread August 2019

habryka

3y

96

29 Attempts at Forwarding Speed Priors

james.lucassen

2mo

2

26 Forum Digest: Corrigibility, utility indifference, & related control ideas

Benya_Fallenstein

7y

0

25 What are concrete examples of potential "lock-in" in AI research?

Grue_Slinky

3y

6

24 Where's the Turing Machine? A step towards Ontology Identification

adamShimi

2y

0

23 VOI is Only Nonnegative When Information is Uncorrelated With Future Action

Diffractor

4y

2

23 Epistemic Strategies of Selection Theorems

adamShimi

1y

1

23 On the falsifiability of hypercomputation, part 2: finite input streams

jessicata

2y

7