Tree of Tags

Go Back

You can't go any further

Choose this branch

meritocratic regular democratic

hot top alive

9 posts Organization Updates

14 posts Redwood Research Adversarial Training AI Robustness

64 What I've been doing instead of writing

benkuhn

1y

3

49 Two clarifications about "Strategic Background"

Rob Bensinger

4y

6

39 Genomic Prediction is now offering embryo selection

gwern

4y

1

55 What's up with Arbital?

Alexei

5y

91

40 Giving What We Can needs your help!

RobertWiblin

7y

6

61 Get genotyped for free ( If your IQ is high enough)

David Althaus

11y

63

19 RAISE is looking for full-time content developers

4y

5

43 Help the Brain Preservation Foundation

aurellem

9y

20

9 Symbiosis - An Intentional Community For Radical Self-Improvement

Matt Goldenberg

4y

0

96 Causal Scrubbing: a method for rigorously testing interpretability hypotheses [Redwood Research]

LawrenceC

17d

9

109 Apply to the Redwood Research Mechanistic Interpretability Experiment (REMIX), a research program in Berkeley

maxnadeau

1mo

14

26 Causal scrubbing: results on a paren balance checker

LawrenceC

17d

0

67 Some Lessons Learned from Studying Indirect Object Identification in GPT-2 small

KevinRoWang

1mo

5

127 Takeaways from our robust injury classifier project [Redwood Research]

dmz

3mo

9

14 Causal scrubbing: Appendix

LawrenceC

17d

0

88 High-stakes alignment via adversarial training [Redwood Research report]

dmz

7mo

29

165 Redwood Research’s current project

Buck

1y

29

126 Why I'm excited about Redwood Research's current project

paulfchristiano

1y

6

38 Adversarial training, importance sampling, and anti-adversarial training for AI whistleblowing

Buck

6mo

0

17 AXRP Episode 17 - Training for Very High Reliability with Daniel Ziegler

DanielFilan

4mo

0

52 Redwood's Technique-Focused Epistemic Strategy

adamShimi

1y

1

62 We're Redwood Research, we do applied alignment research, AMA

Nate Thomas

1y

3

17 Latent Adversarial Training

Adam Jermyn

5mo

9