Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
14 posts
Factored Cognition
Ought
14 posts
Debate (AI safety technique)
Adversarial Collaboration
85
Rant on Problem Factorization for Alignment
johnswentworth
4mo
48
122
Supervise Process, not Outcomes
stuhlmueller
8mo
8
31
A Library and Tutorial for Factored Cognition with Language Models
stuhlmueller
2mo
0
21
Ought will host a factored cognition “Lab Meeting”
jungofthewon
3mo
1
81
Ought: why it matters and ways to help
paulfchristiano
3y
7
45
Preface to the Sequence on Factored Cognition
Rafael Harth
2y
7
42
Idealized Factored Cognition
Rafael Harth
2y
6
65
Vaniver's View on Factored Cognition
Vaniver
3y
4
29
Clarifying Factored Cognition
Rafael Harth
2y
2
20
Traversing a Cognition Space
Rafael Harth
2y
5
29
Update on Ought's experiments on factored evaluation of arguments
Owain_Evans
2y
0
41
Factored Cognition
stuhlmueller
4y
6
18
[AN #86]: Improving debate and factored cognition through human experiments
Rohin Shah
2y
0
23
Alignment Newsletter #36
Rohin Shah
4y
0
47
Take 9: No, RLHF/IDA/debate doesn't solve outer alignment.
Charlie Steiner
8d
14
35
A Small Negative Result on Debate
Sam Bowman
8mo
11
106
Imitative Generalisation (AKA 'Learning the Prior')
Beth Barnes
1y
14
80
Why I'm excited about Debate
Richard_Ngo
1y
12
97
Writeup: Progress on AI Safety via Debate
Beth Barnes
2y
18
66
A guide to Iterated Amplification & Debate
Rafael Harth
2y
10
65
Looking for adversarial collaborators to test our Debate protocol
Beth Barnes
2y
5
61
How should AI debate be judged?
abramdemski
2y
27
47
Debate Minus Factored Cognition
abramdemski
1y
42
35
New paper: (When) is Truth-telling Favored in AI debate?
VojtaKovarik
2y
7
28
AI Safety Debate and Its Applications
VojtaKovarik
3y
5
24
Problems with AI debate
Stuart_Armstrong
3y
3
35
AI Safety via Debate
ESRogs
4y
13
16
Thoughts on "AI safety via debate"
Gordon Seidoh Worley
4y
4