Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
19 posts
AI Boxing (Containment)
AI Persuasion
79 posts
Oracle AI
Acausal Trade
Superrationality
Values handshakes
76
Risks from AI persuasion
Beth Barnes
12mo
15
87
Persuasion Tools: AI takeover without AGI or agency?
Daniel Kokotajlo
2y
24
14
Multiple AIs in boxes, evaluating each other's alignment
Moebius314
6mo
0
120
The Strangest Thing An AI Could Tell You
Eliezer Yudkowsky
13y
605
65
I attempted the AI Box Experiment (and lost)
Tuxedage
9y
245
58
I attempted the AI Box Experiment again! (And won - Twice!)
Tuxedage
9y
168
37
How To Win The AI Box Experiment (Sometimes)
pinkgothic
7y
21
42
I played the AI Box Experiment again! (and lost both games)
Tuxedage
9y
123
29
How to escape from your sandbox and from your hardware host
PhilGoetz
7y
28
14
Sandboxing by Physical Simulation?
moridinamael
4y
4
44
Cryptographic Boxes for Unfriendly AI
paulfchristiano
12y
162
10
Is there a simple parameter that controls human working memory capacity, which has been set tragically low?
Liron
3y
8
14
AI Alignment Prize: Super-Boxing
X4vier
4y
6
19
AI box: AI has one shot at avoiding destruction - what might it say?
ancientcampus
9y
355
18
Conditions for Superrationality-motivated Cooperation in a one-shot Prisoner's Dilemma
Jim Buhler
1d
2
10
Prosaic misalignment from the Solomonoff Predictor
Cleo Nardo
11d
0
39
The Solomonoff prior is malign. It's not a big deal.
Charlie Steiner
3mo
9
61
[Repost] Non-Nashian Game Theory: A Normal-Form Primer
Ghislain Fourny
6mo
14
110
A new acausal trading platform: RobinShould
Matthew Barnett
1y
5
35
Superrational Agents Kelly Bet Influence!
abramdemski
1y
5
51
Results of $1,000 Oracle contest!
Stuart_Armstrong
2y
2
1
How does acausal trade work in a deterministic multiverse?
sisyphus
1mo
13
107
[REPOST] The Demiurge’s Older Brother
Scott Alexander
5y
2
57
Contest: $1,000 for good questions to ask to an Oracle AI
Stuart_Armstrong
3y
156
54
Book Review: AI Safety and Security
Michaël Trazzi
4y
2
39
Counterfactual Oracles = online supervised learning with random selection of training episodes
Wei_Dai
3y
26
23
Breaking Oracles: superrationality and acausal trade
Stuart_Armstrong
3y
15
18
Analysing: Dangerous messages from future UFAI via Oracles
Stuart_Armstrong
3y
16