Tree of Tags

Go Back

You can't go any further

You can't go any further

meritocratic regular democratic

hot top alive

0 posts Goal Factoring

20 posts Research Agendas

34 My AGI safety research—2022 review, ’23 plans

Steven Byrnes

6d

6

258 On how various plans miss the hard bits of the alignment challenge

So8res

5mo

81

168 Some conceptual alignment research projects

Richard_Ngo

3mo

14

15 Distilled Representations Research Agenda

Hoagy

2mo

2

45 Resources for AI Alignment Cartography

Gyrodiot

2y

8

43 Technical AGI safety research outside AI

Richard_Ngo

3y

3

28 Why I am not currently working on the AAMLS agenda

jessicata

5y

1

76 The Learning-Theoretic AI Alignment Research Agenda

Vanessa Kosoy

4y

39

124 Thoughts on Human Models

Ramana Kumar

3y

32

67 Research Agenda v0.9: Synthesising a human's preferences into a utility function

Stuart_Armstrong

3y

25

54 Research agenda update

Steven Byrnes

1y

40

34 New safety research agenda: scalable agent alignment via reward modeling

Vika

4y

13

34 Research Agenda in reverse: what *would* a solution look like?

Stuart_Armstrong

3y

25

6 Acknowledgements & References

JesseClifton

3y

0