Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

1014 posts AI AI Timelines Value Learning AI Takeoff Embedded Agency Community Eliciting Latent Knowledge (ELK) Reinforcement Learning Infra-Bayesianism Counterfactuals Logic & Mathematics Interviews

111 posts Iterated Amplification Game Theory Factored Cognition Humans Consulting HCH Research Agendas Ought Debate (AI safety technique) Risks of Astronomical Suffering (S-risks) Center on Long-Term Risk (CLR) Mechanism Design Fairness Group Rationality

503 (My understanding of) What Everyone in Technical Alignment is Doing and Why

Thomas Larsen

3mo

83

486 What 2026 looks like

Daniel Kokotajlo

1y

98

409 Discussion with Eliezer Yudkowsky on AGI interventions

Rob Bensinger

1y

257

334 EfficientZero: How It Works

1a3orn

1y

42

315 Two-year update on my personal AI timelines

Ajeya Cotra

4mo

60

297 Why Agent Foundations? An Overly Abstract Explanation

johnswentworth

9mo

54

296 Are we in an AI overhang?

Andy Jones

2y

109

276 Fun with +12 OOMs of Compute

Daniel Kokotajlo

1y

78

271 Reward is not the optimization target

TurnTrout

4mo

97

265 The Plan

johnswentworth

1y

77

265 Ngo and Yudkowsky on alignment difficulty

Eliezer Yudkowsky

1y

143

265 An overview of 11 proposals for building safe advanced AI

evhub

2y

36

263 DeepMind: Generally capable agents emerge from open-ended play

Daniel Kokotajlo

1y

53

258 Draft report on AI timelines

Ajeya Cotra

2y

56

285 On how various plans miss the hard bits of the alignment challenge

So8res

5mo

81

181 Some conceptual alignment research projects

Richard_Ngo

3mo

14

174 Unifying Bargaining Notions (1/2)

Diffractor

4mo

38

143 The Commitment Races problem

Daniel Kokotajlo

3y

39

132 Paul's research agenda FAQ

zhukeepa

4y

73

125 «Boundaries», Part 1: a key missing concept from utility theory

Andrew_Critch

4mo

26

121 Thoughts on Human Models

Ramana Kumar

3y

32

114 My Understanding of Paul Christiano's Iterated Amplification AI Safety Research Agenda

Chi Nguyen

2y

21

114 Supervise Process, not Outcomes

stuhlmueller

8mo

8

111 Debate update: Obfuscated arguments problem

Beth Barnes

1y

21

102 Our take on CHAI’s research agenda in under 1500 words

Alex Flint

2y

19

93 Ought: why it matters and ways to help

paulfchristiano

3y

7

91 Writeup: Progress on AI Safety via Debate

Beth Barnes

2y

18

84 Unifying Bargaining Notions (2/2)

Diffractor

4mo

11