Go Back
You can't go any further
You can't go any further
meritocratic
regular
democratic
hot
top
alive
7 posts
Treacherous Turn
4 posts
Tripwire
102
Soares, Tallinn, and Yudkowsky discuss AGI cognition
So8res
1y
35
12
Superintelligence 11: The treacherous turn
KatjaGrace
8y
50
23
[Linkpost] Treacherous turns in the wild
Mark Xu
1y
6
20
[AN #165]: When large models are more likely to lie
Rohin Shah
1y
0
80
A Gym Gridworld Environment for the Treacherous Turn
Michaël Trazzi
4y
9
16
Any work on honeypots (to detect treacherous turn attempts)?
David Scott Krueger (formerly: capybaralet)
2y
4
33
A toy model of the treacherous turn
Stuart_Armstrong
6y
13
9
Superintelligence 13: Capability control methods
KatjaGrace
8y
48
1
Corrigibility thoughts I: caring about multiple things
Stuart_Armstrong
5y
0
2
Corrigibility thoughts II: the robot operator
Stuart_Armstrong
5y
2
2
Corrigibility thoughts III: manipulating versus deceiving
Stuart_Armstrong
5y
0