Go Back
Choose this branch
Choose this branch
meritocratic
regular
democratic
hot
top
alive
32 posts
Machine Learning (ML)
OpenAI
Lottery Ticket Hypothesis
19 posts
DeepMind
Truth, Semantics, & Meaning
Anthropic
Honesty
Map and Territory
Calibration
226
Common misconceptions about OpenAI
Jacob_Hilton
3mo
138
146
the scaling “inconsistency”: openAI’s new insight
nostalgebraist
2y
14
135
Understanding “Deep Double Descent”
evhub
3y
51
125
A Bird's Eye View of the ML Field [Pragmatic AI Safety #2]
Dan H
7mo
5
89
Survey of NLP Researchers: NLP is contributing to AGI progress; major catastrophe plausible
Sam Bowman
3mo
6
89
Safety Implications of LeCun's path to machine intelligence
Ivan Vendrov
5mo
16
80
Gradations of Inner Alignment Obstacles
abramdemski
1y
22
67
Using GPT-N to Solve Interpretability of Neural Networks: A Research Agenda
Logan Riggs
2y
12
63
Inductive biases stick around
evhub
3y
14
60
SGD's Bias
johnswentworth
1y
16
57
Unsolved ML Safety Problems
jsteinhardt
1y
2
57
Multimodal Neurons in Artificial Neural Networks
Kaj_Sotala
1y
2
54
Tabooing 'Agent' for Prosaic Alignment
Hjalmar_Wijk
3y
10
52
A Data limited future
Donald Hobson
4mo
25
364
DeepMind alignment team opinions on AGI ruin arguments
Vika
4mo
34
265
A challenge for AGI organizations, and a challenge for readers
Rob Bensinger
19d
30
104
Caution when interpreting Deepmind's In-context RL paper
Sam Marks
1mo
6
102
Clarifying AI X-risk
zac_kenton
1mo
23
96
Paper: Teaching GPT3 to express uncertainty in words
Owain_Evans
6mo
7
80
Paper: Discovering novel algorithms with AlphaTensor [Deepmind]
LawrenceC
2mo
18
65
Truthful LMs as a warm-up for aligned AGI
Jacob_Hilton
11mo
14
64
Toy Models of Superposition
evhub
3mo
2
52
Autonomy as taking responsibility for reference maintenance
Ramana Kumar
4mo
3
42
How do new models from OpenAI, DeepMind and Anthropic perform on TruthfulQA?
Owain_Evans
9mo
3
40
A comment on the IDA-AlphaGoZero metaphor; capabilities versus alignment
AlexMennen
4y
1
32
AlphaGo Zero and capability amplification
paulfchristiano
3y
23
30
Finding the variables
Stuart_Armstrong
3y
1
29
The accumulation of knowledge: literature review
Alex Flint
1y
3