Go Back
You can't go any further
You can't go any further
meritocratic
regular
democratic
hot
top
alive
7 posts
Treacherous Turn
4 posts
Tripwire
134
Soares, Tallinn, and Yudkowsky discuss AGI cognition
So8res
1y
35
66
A Gym Gridworld Environment for the Treacherous Turn
Michaël Trazzi
4y
9
39
[Linkpost] Treacherous turns in the wild
Mark Xu
1y
6
39
A toy model of the treacherous turn
Stuart_Armstrong
6y
13
26
[AN #165]: When large models are more likely to lie
Rohin Shah
1y
0
20
Superintelligence 11: The treacherous turn
KatjaGrace
8y
50
18
Any work on honeypots (to detect treacherous turn attempts)?
David Scott Krueger (formerly: capybaralet)
2y
4
19
Superintelligence 13: Capability control methods
KatjaGrace
8y
48
4
Corrigibility thoughts II: the robot operator
Stuart_Armstrong
5y
2
4
Corrigibility thoughts III: manipulating versus deceiving
Stuart_Armstrong
5y
0
3
Corrigibility thoughts I: caring about multiple things
Stuart_Armstrong
5y
0