Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
42 posts
GPT
Bounties & Prizes (active)
AI-assisted Alignment
AI Safety Public Materials
List of Links
8 posts
Moore's Law
Compute
Nanotechnology
Computer Science
Tripwire
Quantum Mechanics
93
[Link] Why I’m optimistic about OpenAI’s alignment approach
janleike
15d
13
26
An exploration of GPT-2's embedding weights
Adam Scherlis
7d
2
16
Research request (alignment strategy): Deep dive on "making AI solve alignment for us"
JanBrauner
19d
3
13
[LINK] - ChatGPT discussion
JanBrauner
19d
7
17
Distribution Shifts and The Importance of AI Safety
Leon Lang
2mo
2
8
AI-assisted list of ten concrete alignment things to do right now
lcmgcd
3mo
5
72
NeurIPS ML Safety Workshop 2022
Dan H
4mo
2
74
[$20K in Prizes] AI Safety Arguments Competition
Dan H
7mo
543
140
Developmental Stages of GPTs
orthonormal
2y
74
0
New(ish) AI control ideas
Stuart_Armstrong
5y
0
89
Collection of GPT-3 results
Kaj_Sotala
2y
24
158
interpreting GPT: the logit lens
nostalgebraist
2y
32
136
MIRI comments on Cotra's "Case for Aligning Narrowly Superhuman Models"
Rob Bensinger
1y
13
111
Alignment As A Bottleneck To Usefulness Of GPT-3
johnswentworth
2y
57
2
Corrigibility thoughts I: caring about multiple things
Stuart_Armstrong
5y
0
27
Algorithmic Similarity
LukasM
3y
10
91
Compute Trends Across Three eras of Machine Learning
Jsevillamol
10mo
13
24
Implications of Quantum Computing for Artificial Intelligence Alignment Research
Jsevillamol
3y
3
120
Moore's Law, AI, and the pace of progress
Veedrac
1y
39
42
Reasons compute may not drive AI capabilities growth
Kythe
4y
10
34
Verification and Transparency
DanielFilan
3y
6
59
Predicting GPU performance
Marius Hobbhahn
6d
24