Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
42 posts
GPT
Bounties & Prizes (active)
AI-assisted Alignment
AI Safety Public Materials
List of Links
8 posts
Moore's Law
Compute
Nanotechnology
Computer Science
Tripwire
Quantum Mechanics
90
[Link] Why I’m optimistic about OpenAI’s alignment approach
janleike
15d
13
18
An exploration of GPT-2's embedding weights
Adam Scherlis
7d
2
45
By Default, GPTs Think In Plain Sight
Fabien Roger
1mo
16
12
[LINK] - ChatGPT discussion
JanBrauner
19d
7
12
Research request (alignment strategy): Deep dive on "making AI solve alignment for us"
JanBrauner
19d
3
36
Prizes for ML Safety Benchmark Ideas
joshc
1mo
3
4
Alignment with argument-networks and assessment-predictions
Tor Økland Barstad
7d
3
85
Beliefs and Disagreements about Automating Alignment Research
Ian McKenzie
3mo
4
191
New Scaling Laws for Large Language Models
1a3orn
8mo
21
127
Godzilla Strategies
johnswentworth
6mo
65
68
NeurIPS ML Safety Workshop 2022
Dan H
4mo
2
52
$20K In Bounties for AI Safety Public Materials
Dan H
4mo
7
23
Recall and Regurgitation in GPT2
Megan Kinniment
2mo
1
204
The case for aligning narrowly superhuman models
Ajeya Cotra
1y
74
48
Predicting GPU performance
Marius Hobbhahn
6d
24
120
Moore's Law, AI, and the pace of progress
Veedrac
1y
39
67
Compute Trends Across Three eras of Machine Learning
Jsevillamol
10mo
13
44
Verification and Transparency
DanielFilan
3y
6
41
Reasons compute may not drive AI capabilities growth
Kythe
4y
10
20
Algorithmic Similarity
LukasM
3y
10
19
Implications of Quantum Computing for Artificial Intelligence Alignment Research
Jsevillamol
3y
3
3
Corrigibility thoughts I: caring about multiple things
Stuart_Armstrong
5y
0