Tree of Tags

Go Back

You can't go any further

You can't go any further

meritocratic regular democratic

hot top alive

0 posts Adversarial Collaboration

14 posts Debate (AI safety technique)

106 Imitative Generalisation (AKA 'Learning the Prior')

Beth Barnes

1y

14

97 Writeup: Progress on AI Safety via Debate

Beth Barnes

2y

18

80 Why I'm excited about Debate

Richard_Ngo

1y

12

66 A guide to Iterated Amplification & Debate

Rafael Harth

2y

10

65 Looking for adversarial collaborators to test our Debate protocol

Beth Barnes

2y

5

61 How should AI debate be judged?

abramdemski

2y

27

47 Take 9: No, RLHF/IDA/debate doesn't solve outer alignment.

Charlie Steiner

8d

14

47 Debate Minus Factored Cognition

abramdemski

1y

42

35 A Small Negative Result on Debate

Sam Bowman

8mo

11

35 New paper: (When) is Truth-telling Favored in AI debate?

VojtaKovarik

2y

7

35 AI Safety via Debate

ESRogs

4y

13

28 AI Safety Debate and Its Applications

VojtaKovarik

3y

5

24 Problems with AI debate

Stuart_Armstrong

3y

3

16 Thoughts on "AI safety via debate"

Gordon Seidoh Worley

4y

4