Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
42 posts
GPT
Bounties & Prizes (active)
AI-assisted Alignment
AI Safety Public Materials
List of Links
8 posts
Moore's Law
Compute
Nanotechnology
Computer Science
Tripwire
Quantum Mechanics
90
[Link] Why I’m optimistic about OpenAI’s alignment approach
janleike
15d
13
18
An exploration of GPT-2's embedding weights
Adam Scherlis
7d
2
12
Research request (alignment strategy): Deep dive on "making AI solve alignment for us"
JanBrauner
19d
3
12
[LINK] - ChatGPT discussion
JanBrauner
19d
7
8
Distribution Shifts and The Importance of AI Safety
Leon Lang
2mo
2
5
AI-assisted list of ten concrete alignment things to do right now
lcmgcd
3mo
5
68
NeurIPS ML Safety Workshop 2022
Dan H
4mo
2
22
[$20K in Prizes] AI Safety Arguments Competition
Dan H
7mo
543
125
Developmental Stages of GPTs
orthonormal
2y
74
0
New(ish) AI control ideas
Stuart_Armstrong
5y
0
96
Collection of GPT-3 results
Kaj_Sotala
2y
24
145
interpreting GPT: the logit lens
nostalgebraist
2y
32
164
MIRI comments on Cotra's "Case for Aligning Narrowly Superhuman Models"
Rob Bensinger
1y
13
105
Alignment As A Bottleneck To Usefulness Of GPT-3
johnswentworth
2y
57
3
Corrigibility thoughts I: caring about multiple things
Stuart_Armstrong
5y
0
20
Algorithmic Similarity
LukasM
3y
10
67
Compute Trends Across Three eras of Machine Learning
Jsevillamol
10mo
13
19
Implications of Quantum Computing for Artificial Intelligence Alignment Research
Jsevillamol
3y
3
120
Moore's Law, AI, and the pace of progress
Veedrac
1y
39
41
Reasons compute may not drive AI capabilities growth
Kythe
4y
10
44
Verification and Transparency
DanielFilan
3y
6
48
Predicting GPU performance
Marius Hobbhahn
6d
24