Tree of Tags

Go Back

Choose this branch

Choose this branch

meritocratic regular democratic

hot top alive

32 posts Machine Learning (ML) OpenAI Lottery Ticket Hypothesis

19 posts DeepMind Truth, Semantics, & Meaning Anthropic Honesty Map and Territory Calibration

59 Reframing inner alignment

davidad

9d

13

19 My thoughts on OpenAI's Alignment plan

Donald Hobson

10d

0

199 Common misconceptions about OpenAI

Jacob_Hilton

3mo

138

81 Survey of NLP Researchers: NLP is contributing to AGI progress; major catastrophe plausible

Sam Bowman

3mo

6

43 Paper+Summary: OMNIGROK: GROKKING BEYOND ALGORITHMIC DATA

Marius Hobbhahn

2mo

11

84 Safety Implications of LeCun's path to machine intelligence

Ivan Vendrov

5mo

16

51 Steganography in Chain of Thought Reasoning

A Ray

4mo

13

49 A Data limited future

Donald Hobson

4mo

25

77 A Bird's Eye View of the ML Field [Pragmatic AI Safety #2]

Dan H

7mo

5

11 [MLSN #5]: Prize Compilation

Dan H

2mo

1

100 Gradations of Inner Alignment Obstacles

abramdemski

1y

22

123 the scaling “inconsistency”: openAI’s new insight

nostalgebraist

2y

14

27 The No Free Lunch theorems and their Razor

Adrià Garriga-alonso

7mo

3

60 Unsolved ML Safety Problems

jsteinhardt

1y

2

223 A challenge for AGI organizations, and a challenge for readers

Rob Bensinger

19d

30

318 DeepMind alignment team opinions on AGI ruin arguments

Vika

4mo

34

104 Caution when interpreting Deepmind's In-context RL paper

Sam Marks

1mo

6

64 Clarifying AI X-risk

zac_kenton

1mo

23

86 Paper: Discovering novel algorithms with AlphaTensor [Deepmind]

LawrenceC

2mo

18

66 Toy Models of Superposition

evhub

3mo

2

27 Maps and Blueprint; the Two Sides of the Alignment Equation

Nora_Ammann

1mo

1

55 Autonomy as taking responsibility for reference maintenance

Ramana Kumar

4mo

3

91 Paper: Teaching GPT3 to express uncertainty in words

Owain_Evans

6mo

7

19 Paper: In-context Reinforcement Learning with Algorithm Distillation [Deepmind]

LawrenceC

1mo

5

62 Truthful LMs as a warm-up for aligned AGI

Jacob_Hilton

11mo

14

35 How do new models from OpenAI, DeepMind and Anthropic perform on TruthfulQA?

Owain_Evans

9mo

3

38 The accumulation of knowledge: literature review

Alex Flint

1y

3

25 Knowledge is not just precipitation of action

Alex Flint

1y

6