Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

48 posts Community Center for Human-Compatible AI (CHAI) Moral Uncertainty Regulation and AI Risk Grants & Fundraising Opportunities Future of Humanity Institute (FHI) Population Ethics Utilitarianism The SF Bay Area Future of Life Institute (FLI) Disagreement Events (Community)

11 posts Agent Foundations Machine Intelligence Research Institute (MIRI) Cognitive Reduction Dissolving the Question

13 Event [Berkeley]: Alignment Collaborator Speed-Meeting

AlexMennen

1d

2

13 Looking for an alignment tutor

JanBrauner

3d

2

87 Announcing AI Alignment Awards: $100k research contests about goal misgeneralization & corrigibility

Akash

28d

20

35 A newcomer’s guide to the technical AI safety field

zeshen

1mo

1

91 AI Safety and Neighboring Communities: A Quick-Start Guide, as of Summer 2022

Sam Bowman

3mo

2

99 Announcing the Introduction to ML Safety course

Dan H

4mo

6

20 The Slippery Slope from DALLE-2 to Deepfake Anarchy

scasper

1mo

9

63 Seeking Interns/RAs for Mechanistic Interpretability Projects

Neel Nanda

4mo

0

54 Encultured AI Pre-planning, Part 1: Enabling New Benchmarks

Andrew_Critch

4mo

2

94 Introducing the ML Safety Scholars Program

Dan H

7mo

2

24 CHAI, Assistance Games, And Fully-Updated Deference [Scott Alexander]

berglund

2mo

1

31 *New* Canada AI Safety & Governance community

Wyatt Tessari L'Allié

3mo

0

58 Jobs: Help scale up LM alignment research at NYU

Sam Bowman

7mo

1

85 Apply to the ML for Alignment Bootcamp (MLAB) in Berkeley [Jan 3 - Jan 22]

habryka

1y

4

297 Why Agent Foundations? An Overly Abstract Explanation

johnswentworth

9mo

54

85 Prize and fast track to alignment research at ALTER

Vanessa Kosoy

3mo

4

61 Clarifying the Agent-Like Structure Problem

johnswentworth

2mo

14

83 Challenges with Breaking into MIRI-Style Research

Chris_Leong

11mo

15

23 Bridging Expected Utility Maximization and Optimization

Whispermute

4mo

5

250 The Rocket Alignment Problem

Eliezer Yudkowsky

4y

42

37 Grokking the Intentional Stance

jbkjr

1y

20

100 What I’ll be doing at MIRI

evhub

3y

6

74 Another take on agent foundations: formalizing zero-shot reasoning

zhukeepa

4y

20

24 On motivations for MIRI's highly reliable agent design research

jessicata

5y

1

15 My current take on the Paul-MIRI disagreement on alignability of messy AI

jessicata

5y

0