Tree of Tags

Go Back

You can't go any further

You can't go any further

meritocratic regular democratic

hot top alive

0 posts Adversarial Collaboration

14 posts Debate (AI safety technique)

36 Take 9: No, RLHF/IDA/debate doesn't solve outer alignment.

Charlie Steiner

8d

14

42 A Small Negative Result on Debate

Sam Bowman

8mo

11

36 AI Safety Debate and Its Applications

VojtaKovarik

3y

5

94 Writeup: Progress on AI Safety via Debate

Beth Barnes

2y

18

92 Imitative Generalisation (AKA 'Learning the Prior')

Beth Barnes

1y

14

73 Why I'm excited about Debate

Richard_Ngo

1y

12

32 New paper: (When) is Truth-telling Favored in AI debate?

VojtaKovarik

2y

7

12 Thoughts on "AI safety via debate"

Gordon Seidoh Worley

4y

4

37 Debate Minus Factored Cognition

abramdemski

1y

42

21 Problems with AI debate

Stuart_Armstrong

3y

3

49 How should AI debate be judged?

abramdemski

2y

27

68 A guide to Iterated Amplification & Debate

Rafael Harth

2y

10

52 Looking for adversarial collaborators to test our Debate protocol

Beth Barnes

2y

5

27 AI Safety via Debate

ESRogs

4y

13