Go Back
You can't go any further
You can't go any further
meritocratic
regular
democratic
hot
top
alive
7 posts
Treacherous Turn
4 posts
Tripwire
134
Soares, Tallinn, and Yudkowsky discuss AGI cognition
So8res
1y
35
20
Superintelligence 11: The treacherous turn
KatjaGrace
8y
50
39
[Linkpost] Treacherous turns in the wild
Mark Xu
1y
6
26
[AN #165]: When large models are more likely to lie
Rohin Shah
1y
0
66
A Gym Gridworld Environment for the Treacherous Turn
Michaël Trazzi
4y
9
18
Any work on honeypots (to detect treacherous turn attempts)?
David Scott Krueger (formerly: capybaralet)
2y
4
39
A toy model of the treacherous turn
Stuart_Armstrong
6y
13
19
Superintelligence 13: Capability control methods
KatjaGrace
8y
48
3
Corrigibility thoughts I: caring about multiple things
Stuart_Armstrong
5y
0
4
Corrigibility thoughts II: the robot operator
Stuart_Armstrong
5y
2
4
Corrigibility thoughts III: manipulating versus deceiving
Stuart_Armstrong
5y
0