Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
31 posts
SERI MATS
AI Alignment Fieldbuilding
Intellectual Progress (Society-Level)
Distillation & Pedagogy
Practice & Philosophy of Science
Information Hazards
PIBBSS
Intellectual Progress via LessWrong
Economic Consequences of AGI
Privacy
Superintelligence
Automation
532 posts
Epistemology
Intellectual Progress (Individual-Level)
Research Taste
Epistemic Review
Selection Effects
Social & Cultural Dynamics
Humility
265
Lessons learned from talking to >100 academics about AI safety
Marius Hobbhahn
2mo
16
221
Call For Distillers
johnswentworth
8mo
42
167
Conjecture: Internal Infohazard Policy
Connor Leahy
4mo
6
158
Most People Start With The Same Few Bad Ideas
johnswentworth
3mo
30
144
Your posts should be on arXiv
JanBrauner
3mo
39
133
The Fusion Power Generator Scenario
johnswentworth
2y
29
100
Productive Mistakes, Not Perfect Answers
adamShimi
8mo
11
99
On Solving Problems Before They Appear: The Weird Epistemologies of Alignment
adamShimi
1y
11
97
Intuitions about solving hard problems
Richard_Ngo
7mo
23
90
Intermittent Distillations #4: Semiconductors, Economics, Intelligence, and Technological Progress.
Mark Xu
1y
9
86
ML Alignment Theory Program under Evan Hubinger
Oliver Zhang
1y
3
76
SERI MATS Program - Winter 2022 Cohort
Ryan Kidd
2mo
12
54
Principles of Privacy for Alignment Research
johnswentworth
4mo
30
53
[An email with a bunch of links I sent an experienced ML researcher interested in learning about Alignment / x-safety.]
David Scott Krueger (formerly: capybaralet)
3mo
1
305
Alignment Research Field Guide
abramdemski
3y
9
73
How to do theoretical research, a personal perspective
Mark Xu
4mo
4
65
How I Formed My Own Views About AI Safety
Neel Nanda
9mo
6
55
Methodological Therapy: An Agenda For Tackling Research Bottlenecks
adamShimi
2mo
6
43
Shapes of Mind and Pluralism in Alignment
adamShimi
4mo
1
38
Torture and Dust Specks and Joy--Oh my! or: Non-Archimedean Utility Functions as Pseudograded Vector Spaces
Louis_Brown
3y
29
32
AI Alignment Open Thread August 2019
habryka
3y
96
29
Attempts at Forwarding Speed Priors
james.lucassen
2mo
2
26
Forum Digest: Corrigibility, utility indifference, & related control ideas
Benya_Fallenstein
7y
0
25
What are concrete examples of potential "lock-in" in AI research?
Grue_Slinky
3y
6
24
Where's the Turing Machine? A step towards Ontology Identification
adamShimi
2y
0
23
VOI is Only Nonnegative When Information is Uncorrelated With Future Action
Diffractor
4y
2
23
Epistemic Strategies of Selection Theorems
adamShimi
1y
1
23
On the falsifiability of hypercomputation, part 2: finite input streams
jessicata
2y
7