Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
42 posts
GPT
Bounties & Prizes (active)
AI-assisted Alignment
AI Safety Public Materials
List of Links
8 posts
Moore's Law
Compute
Nanotechnology
Computer Science
Tripwire
Quantum Mechanics
93
[Link] Why I’m optimistic about OpenAI’s alignment approach
janleike
15d
13
26
An exploration of GPT-2's embedding weights
Adam Scherlis
7d
2
60
By Default, GPTs Think In Plain Sight
Fabien Roger
1mo
16
31
[ASoT] Finetuning, RL, and GPT's world prior
Jozdien
18d
8
7
Alignment with argument-networks and assessment-predictions
Tor Økland Barstad
7d
3
16
Research request (alignment strategy): Deep dive on "making AI solve alignment for us"
JanBrauner
19d
3
13
[LINK] - ChatGPT discussion
JanBrauner
19d
7
92
Beliefs and Disagreements about Automating Alignment Research
Ian McKenzie
3mo
4
223
New Scaling Laws for Large Language Models
1a3orn
8mo
21
36
Prizes for ML Safety Benchmark Ideas
joshc
1mo
3
151
Godzilla Strategies
johnswentworth
6mo
65
68
$20K In Bounties for AI Safety Public Materials
Dan H
4mo
7
72
NeurIPS ML Safety Workshop 2022
Dan H
4mo
2
33
Recall and Regurgitation in GPT2
Megan Kinniment
2mo
1
59
Predicting GPU performance
Marius Hobbhahn
6d
24
120
Moore's Law, AI, and the pace of progress
Veedrac
1y
39
91
Compute Trends Across Three eras of Machine Learning
Jsevillamol
10mo
13
42
Reasons compute may not drive AI capabilities growth
Kythe
4y
10
34
Verification and Transparency
DanielFilan
3y
6
27
Algorithmic Similarity
LukasM
3y
10
24
Implications of Quantum Computing for Artificial Intelligence Alignment Research
Jsevillamol
3y
3
2
Corrigibility thoughts I: caring about multiple things
Stuart_Armstrong
5y
0