Go Back
You can't go any further
You can't go any further
meritocratic
regular
democratic
hot
top
alive
0 posts
Adversarial Collaboration
14 posts
Debate (AI safety technique)
106
Imitative Generalisation (AKA 'Learning the Prior')
Beth Barnes
1y
14
97
Writeup: Progress on AI Safety via Debate
Beth Barnes
2y
18
80
Why I'm excited about Debate
Richard_Ngo
1y
12
66
A guide to Iterated Amplification & Debate
Rafael Harth
2y
10
65
Looking for adversarial collaborators to test our Debate protocol
Beth Barnes
2y
5
61
How should AI debate be judged?
abramdemski
2y
27
47
Take 9: No, RLHF/IDA/debate doesn't solve outer alignment.
Charlie Steiner
8d
14
47
Debate Minus Factored Cognition
abramdemski
1y
42
35
A Small Negative Result on Debate
Sam Bowman
8mo
11
35
New paper: (When) is Truth-telling Favored in AI debate?
VojtaKovarik
2y
7
35
AI Safety via Debate
ESRogs
4y
13
28
AI Safety Debate and Its Applications
VojtaKovarik
3y
5
24
Problems with AI debate
Stuart_Armstrong
3y
3
16
Thoughts on "AI safety via debate"
Gordon Seidoh Worley
4y
4