Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

32 posts Machine Learning (ML) OpenAI Lottery Ticket Hypothesis

19 posts DeepMind Truth, Semantics, & Meaning Anthropic Honesty Map and Territory Calibration

35 Reframing inner alignment

davidad

9d

13

21 My thoughts on OpenAI's Alignment plan

Donald Hobson

10d

0

253 Common misconceptions about OpenAI

Jacob_Hilton

3mo

138

97 Survey of NLP Researchers: NLP is contributing to AGI progress; major catastrophe plausible

Sam Bowman

3mo

6

173 A Bird's Eye View of the ML Field [Pragmatic AI Safety #2]

Dan H

7mo

5

45 Paper+Summary: OMNIGROK: GROKKING BEYOND ALGORITHMIC DATA

Marius Hobbhahn

2mo

11

94 Safety Implications of LeCun's path to machine intelligence

Ivan Vendrov

5mo

16

55 A Data limited future

Donald Hobson

4mo

25

47 Steganography in Chain of Thought Reasoning

A Ray

4mo

13

67 The No Free Lunch theorems and their Razor

Adrià Garriga-alonso

7mo

3

17 [MLSN #5]: Prize Compilation

Dan H

2mo

1

169 the scaling “inconsistency”: openAI’s new insight

nostalgebraist

2y

14

24 Train first VS prune first in neural networks.

Donald Hobson

5mo

5

156 Understanding “Deep Double Descent”

evhub

3y

51

307 A challenge for AGI organizations, and a challenge for readers

Rob Bensinger

19d

30

140 Clarifying AI X-risk

zac_kenton

1mo

23

410 DeepMind alignment team opinions on AGI ruin arguments

Vika

4mo

34

104 Caution when interpreting Deepmind's In-context RL paper

Sam Marks

1mo

6

74 Paper: Discovering novel algorithms with AlphaTensor [Deepmind]

LawrenceC

2mo

18

37 Paper: In-context Reinforcement Learning with Algorithm Distillation [Deepmind]

LawrenceC

1mo

5

62 Toy Models of Superposition

evhub

3mo

2

101 Paper: Teaching GPT3 to express uncertainty in words

Owain_Evans

6mo

7

49 Autonomy as taking responsibility for reference maintenance

Ramana Kumar

4mo

3

15 Maps and Blueprint; the Two Sides of the Alignment Equation

Nora_Ammann

1mo

1

68 Truthful LMs as a warm-up for aligned AGI

Jacob_Hilton

11mo

14

49 How do new models from OpenAI, DeepMind and Anthropic perform on TruthfulQA?

Owain_Evans

9mo

3

20 The accumulation of knowledge: literature review

Alex Flint

1y

3

17 Knowledge is not just precipitation of action

Alex Flint

1y

6