Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

32 posts Machine Learning (ML) OpenAI Lottery Ticket Hypothesis

19 posts DeepMind Truth, Semantics, & Meaning Anthropic Honesty Map and Territory Calibration

226 Common misconceptions about OpenAI

Jacob_Hilton

3mo

138

146 the scaling “inconsistency”: openAI’s new insight

nostalgebraist

2y

14

135 Understanding “Deep Double Descent”

evhub

3y

51

125 A Bird's Eye View of the ML Field [Pragmatic AI Safety #2]

Dan H

7mo

5

89 Survey of NLP Researchers: NLP is contributing to AGI progress; major catastrophe plausible

Sam Bowman

3mo

6

89 Safety Implications of LeCun's path to machine intelligence

Ivan Vendrov

5mo

16

80 Gradations of Inner Alignment Obstacles

abramdemski

1y

22

67 Using GPT-N to Solve Interpretability of Neural Networks: A Research Agenda

Logan Riggs

2y

12

63 Inductive biases stick around

evhub

3y

14

60 SGD's Bias

johnswentworth

1y

16

57 Unsolved ML Safety Problems

jsteinhardt

1y

2

57 Multimodal Neurons in Artificial Neural Networks

Kaj_Sotala

1y

2

54 Tabooing 'Agent' for Prosaic Alignment

Hjalmar_Wijk

3y

10

52 A Data limited future

Donald Hobson

4mo

25

364 DeepMind alignment team opinions on AGI ruin arguments

Vika

4mo

34

265 A challenge for AGI organizations, and a challenge for readers

Rob Bensinger

19d

30

104 Caution when interpreting Deepmind's In-context RL paper

Sam Marks

1mo

6

102 Clarifying AI X-risk

zac_kenton

1mo

23

96 Paper: Teaching GPT3 to express uncertainty in words

Owain_Evans

6mo

7

80 Paper: Discovering novel algorithms with AlphaTensor [Deepmind]

LawrenceC

2mo

18

65 Truthful LMs as a warm-up for aligned AGI

Jacob_Hilton

11mo

14

64 Toy Models of Superposition

evhub

3mo

2

52 Autonomy as taking responsibility for reference maintenance

Ramana Kumar

4mo

3

42 How do new models from OpenAI, DeepMind and Anthropic perform on TruthfulQA?

Owain_Evans

9mo

3

40 A comment on the IDA-AlphaGoZero metaphor; capabilities versus alignment

AlexMennen

4y

1

32 AlphaGo Zero and capability amplification

paulfchristiano

3y

23

30 Finding the variables

Stuart_Armstrong

3y

1

29 The accumulation of knowledge: literature review

Alex Flint

1y

3