Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
42 posts
GPT
Bounties & Prizes (active)
AI-assisted Alignment
AI Safety Public Materials
List of Links
8 posts
Moore's Law
Compute
Nanotechnology
Computer Science
Tripwire
Quantum Mechanics
96
[Link] Why I’m optimistic about OpenAI’s alignment approach
janleike
15d
13
34
An exploration of GPT-2's embedding weights
Adam Scherlis
7d
2
20
Research request (alignment strategy): Deep dive on "making AI solve alignment for us"
JanBrauner
19d
3
14
[LINK] - ChatGPT discussion
JanBrauner
19d
7
26
Distribution Shifts and The Importance of AI Safety
Leon Lang
2mo
2
11
AI-assisted list of ten concrete alignment things to do right now
lcmgcd
3mo
5
76
NeurIPS ML Safety Workshop 2022
Dan H
4mo
2
126
[$20K in Prizes] AI Safety Arguments Competition
Dan H
7mo
543
155
Developmental Stages of GPTs
orthonormal
2y
74
0
New(ish) AI control ideas
Stuart_Armstrong
5y
0
82
Collection of GPT-3 results
Kaj_Sotala
2y
24
171
interpreting GPT: the logit lens
nostalgebraist
2y
32
108
MIRI comments on Cotra's "Case for Aligning Narrowly Superhuman Models"
Rob Bensinger
1y
13
117
Alignment As A Bottleneck To Usefulness Of GPT-3
johnswentworth
2y
57
1
Corrigibility thoughts I: caring about multiple things
Stuart_Armstrong
5y
0
34
Algorithmic Similarity
LukasM
3y
10
115
Compute Trends Across Three eras of Machine Learning
Jsevillamol
10mo
13
29
Implications of Quantum Computing for Artificial Intelligence Alignment Research
Jsevillamol
3y
3
120
Moore's Law, AI, and the pace of progress
Veedrac
1y
39
43
Reasons compute may not drive AI capabilities growth
Kythe
4y
10
24
Verification and Transparency
DanielFilan
3y
6
70
Predicting GPU performance
Marius Hobbhahn
6d
24