Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

48 posts Community Center for Human-Compatible AI (CHAI) Moral Uncertainty Regulation and AI Risk Grants & Fundraising Opportunities Future of Humanity Institute (FHI) Population Ethics Utilitarianism The SF Bay Area Future of Life Institute (FLI) Disagreement Events (Community)

11 posts Agent Foundations Machine Intelligence Research Institute (MIRI) Cognitive Reduction Dissolving the Question

23 Event [Berkeley]: Alignment Collaborator Speed-Meeting

AlexMennen

1d

2

17 Looking for an alignment tutor

JanBrauner

3d

2

51 Announcing AI Alignment Awards: $100k research contests about goal misgeneralization & corrigibility

Akash

28d

20

25 A newcomer’s guide to the technical AI safety field

zeshen

1mo

1

57 AI Safety and Neighboring Communities: A Quick-Start Guide, as of Summer 2022

Sam Bowman

3mo

2

70 Encultured AI Pre-planning, Part 1: Enabling New Benchmarks

Andrew_Critch

4mo

2

59 Seeking Interns/RAs for Mechanistic Interpretability Projects

Neel Nanda

4mo

0

12 The Slippery Slope from DALLE-2 to Deepfake Anarchy

scasper

1mo

9

39 Announcing the Introduction to ML Safety course

Dan H

4mo

6

62 Jobs: Help scale up LM alignment research at NYU

Sam Bowman

7mo

1

18 CHAI, Assistance Games, And Fully-Updated Deference [Scott Alexander]

berglund

2mo

1

105 Apply to the ML for Alignment Bootcamp (MLAB) in Berkeley [Jan 3 - Jan 22]

habryka

1y

4

52 Introducing the ML Safety Scholars Program

Dan H

7mo

2

116 Call for research on evaluating alignment (funding + advice available)

Beth Barnes

1y

11

197 Why Agent Foundations? An Overly Abstract Explanation

johnswentworth

9mo

54

45 Clarifying the Agent-Like Structure Problem

johnswentworth

2mo

14

45 Prize and fast track to alignment research at ALTER

Vanessa Kosoy

3mo

4

23 Bridging Expected Utility Maximization and Optimization

Whispermute

4mo

5

61 Challenges with Breaking into MIRI-Style Research

Chris_Leong

11mo

15

45 Grokking the Intentional Stance

jbkjr

1y

20

120 What I’ll be doing at MIRI

evhub

3y

6

146 The Rocket Alignment Problem

Eliezer Yudkowsky

4y

42

44 Another take on agent foundations: formalizing zero-shot reasoning

zhukeepa

4y

20

30 On motivations for MIRI's highly reliable agent design research

jessicata

5y

1

27 My current take on the Paul-MIRI disagreement on alignability of messy AI

jessicata

5y

0