Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

32 posts Machine Learning (ML) OpenAI Lottery Ticket Hypothesis

19 posts DeepMind Truth, Semantics, & Meaning Anthropic Honesty Map and Territory Calibration

253 Common misconceptions about OpenAI

Jacob_Hilton

3mo

138

173 A Bird's Eye View of the ML Field [Pragmatic AI Safety #2]

Dan H

7mo

5

169 the scaling “inconsistency”: openAI’s new insight

nostalgebraist

2y

14

156 Understanding “Deep Double Descent”

evhub

3y

51

97 Survey of NLP Researchers: NLP is contributing to AGI progress; major catastrophe plausible

Sam Bowman

3mo

6

94 Safety Implications of LeCun's path to machine intelligence

Ivan Vendrov

5mo

16

71 Tabooing 'Agent' for Prosaic Alignment

Hjalmar_Wijk

3y

10

67 The No Free Lunch theorems and their Razor

Adrià Garriga-alonso

7mo

3

60 Gradations of Inner Alignment Obstacles

abramdemski

1y

22

60 Using GPT-N to Solve Interpretability of Neural Networks: A Research Agenda

Logan Riggs

2y

12

56 Inductive biases stick around

evhub

3y

14

55 A Data limited future

Donald Hobson

4mo

25

54 Unsolved ML Safety Problems

jsteinhardt

1y

2

52 Multimodal Neurons in Artificial Neural Networks

Kaj_Sotala

1y

2

410 DeepMind alignment team opinions on AGI ruin arguments

Vika

4mo

34

307 A challenge for AGI organizations, and a challenge for readers

Rob Bensinger

19d

30

140 Clarifying AI X-risk

zac_kenton

1mo

23

104 Caution when interpreting Deepmind's In-context RL paper

Sam Marks

1mo

6

101 Paper: Teaching GPT3 to express uncertainty in words

Owain_Evans

6mo

7

74 Paper: Discovering novel algorithms with AlphaTensor [Deepmind]

LawrenceC

2mo

18

68 Truthful LMs as a warm-up for aligned AGI

Jacob_Hilton

11mo

14

62 Toy Models of Superposition

evhub

3mo

2

49 Autonomy as taking responsibility for reference maintenance

Ramana Kumar

4mo

3

49 How do new models from OpenAI, DeepMind and Anthropic perform on TruthfulQA?

Owain_Evans

9mo

3

37 Paper: In-context Reinforcement Learning with Algorithm Distillation [Deepmind]

LawrenceC

1mo

5

32 A comment on the IDA-AlphaGoZero metaphor; capabilities versus alignment

AlexMennen

4y

1

21 Bridging syntax and semantics, empirically

Stuart_Armstrong

4y

4

21 Finding the variables

Stuart_Armstrong

3y

1