Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
14 posts
Factored Cognition
Ought
14 posts
Debate (AI safety technique)
Adversarial Collaboration
61
Rant on Problem Factorization for Alignment
johnswentworth
4mo
48
114
Supervise Process, not Outcomes
stuhlmueller
8mo
8
49
Ought will host a factored cognition “Lab Meeting”
jungofthewon
3mo
1
49
Factored Cognition
stuhlmueller
4y
6
12
Traversing a Cognition Space
Rafael Harth
2y
5
31
Vaniver's View on Factored Cognition
Vaniver
3y
4
26
Idealized Factored Cognition
Rafael Harth
2y
6
63
A Library and Tutorial for Factored Cognition with Language Models
stuhlmueller
2mo
0
19
Alignment Newsletter #36
Rohin Shah
4y
0
10
[AN #86]: Improving debate and factored cognition through human experiments
Rohin Shah
2y
0
93
Ought: why it matters and ways to help
paulfchristiano
3y
7
17
Clarifying Factored Cognition
Rafael Harth
2y
2
25
Preface to the Sequence on Factored Cognition
Rafael Harth
2y
7
29
Update on Ought's experiments on factored evaluation of arguments
Owain_Evans
2y
0
25
Take 9: No, RLHF/IDA/debate doesn't solve outer alignment.
Charlie Steiner
8d
14
49
A Small Negative Result on Debate
Sam Bowman
8mo
11
44
AI Safety Debate and Its Applications
VojtaKovarik
3y
5
91
Writeup: Progress on AI Safety via Debate
Beth Barnes
2y
18
78
Imitative Generalisation (AKA 'Learning the Prior')
Beth Barnes
1y
14
66
Why I'm excited about Debate
Richard_Ngo
1y
12
29
New paper: (When) is Truth-telling Favored in AI debate?
VojtaKovarik
2y
7
8
Thoughts on "AI safety via debate"
Gordon Seidoh Worley
4y
4
27
Debate Minus Factored Cognition
abramdemski
1y
42
18
Problems with AI debate
Stuart_Armstrong
3y
3
37
How should AI debate be judged?
abramdemski
2y
27
70
A guide to Iterated Amplification & Debate
Rafael Harth
2y
10
39
Looking for adversarial collaborators to test our Debate protocol
Beth Barnes
2y
5
19
AI Safety via Debate
ESRogs
4y
13