Go Back
You can't go any further
You can't go any further
meritocratic
regular
democratic
hot
top
alive
7 posts
Treacherous Turn
4 posts
Tripwire
118
Soares, Tallinn, and Yudkowsky discuss AGI cognition
So8res
1y
35
23
[AN #165]: When large models are more likely to lie
Rohin Shah
1y
0
31
[Linkpost] Treacherous turns in the wild
Mark Xu
1y
6
73
A Gym Gridworld Environment for the Treacherous Turn
Michaël Trazzi
4y
9
17
Any work on honeypots (to detect treacherous turn attempts)?
David Scott Krueger (formerly: capybaralet)
2y
4
36
A toy model of the treacherous turn
Stuart_Armstrong
6y
13
16
Superintelligence 11: The treacherous turn
KatjaGrace
8y
50
14
Superintelligence 13: Capability control methods
KatjaGrace
8y
48
3
Corrigibility thoughts III: manipulating versus deceiving
Stuart_Armstrong
5y
0
3
Corrigibility thoughts II: the robot operator
Stuart_Armstrong
5y
2
2
Corrigibility thoughts I: caring about multiple things
Stuart_Armstrong
5y
0