Tree of Tags

Go Back

You can't go any further

You can't go any further

meritocratic regular democratic

hot top alive

0 posts Goal Factoring

20 posts Research Agendas

33 My AGI safety research—2022 review, ’23 plans

Steven Byrnes

6d

6

285 On how various plans miss the hard bits of the alignment challenge

So8res

5mo

81

181 Some conceptual alignment research projects

Richard_Ngo

3mo

14

20 Distilled Representations Research Agenda

Hoagy

2mo

2

52 Resources for AI Alignment Cartography

Gyrodiot

2y

8

42 Technical AGI safety research outside AI

Richard_Ngo

3y

3

20 Why I am not currently working on the AAMLS agenda

jessicata

5y

1

75 The Learning-Theoretic AI Alignment Research Agenda

Vanessa Kosoy

4y

39

121 Thoughts on Human Models

Ramana Kumar

3y

32

52 Research Agenda v0.9: Synthesising a human's preferences into a utility function

Stuart_Armstrong

3y

25

39 Research agenda update

Steven Byrnes

1y

40

26 New safety research agenda: scalable agent alignment via reward modeling

Vika

4y

13

24 Research Agenda in reverse: what *would* a solution look like?

Stuart_Armstrong

3y

25

8 Acknowledgements & References

JesseClifton

3y

0