Tree of Tags

Go Back

Choose this branch

You can't go any further

meritocratic regular democratic

hot top alive

15 posts Inverse Reinforcement Learning Road To AI Safety Excellence

35 posts Reinforcement Learning

22 A Survey of Foundational Methods in Inverse Reinforcement Learning

adamk

3mo

0

26 Is CIRL a promising agenda?

Chris_Leong

6mo

12

70 Thoughts on "Human-Compatible"

TurnTrout

3y

35

62 Book Review: Human Compatible

Scott Alexander

2y

6

71 RAISE is launching their MVP

3y

1

36 Learning biases and rewards simultaneously

Rohin Shah

3y

3

37 AI Safety Prerequisites Course: Basic abstract representations of computation

RAISE

3y

2

25 Book review: Human Compatible

PeterMcCluskey

2y

2

29 Our plan for 2019-2020: consulting for AI Safety education

RAISE

3y

17

28 IRL 1/8: Inverse Reinforcement Learning and the problem of degeneracy

RAISE

3y

2

28 Model Mis-specification and Inverse Reinforcement Learning

Owain_Evans

4y

3

16 RAISE AI Safety prerequisites map entirely in one post

RAISE

3y

5

6 Biased reward-learning in CIRL

Stuart_Armstrong

4y

3

2 CIRL Wireheading

tom4everitt

5y

0

286 Reward is not the optimization target

TurnTrout

4mo

97

11 AGIs may value intrinsic rewards more than extrinsic ones

catubc

1mo

6

80 Jitters No Evidence of Stupidity in RL

1a3orn

1y

18

21 RLHF

Ansh Radhakrishnan

7mo

5

29 Scalar reward is not enough for aligned AGI

Peter Vamplew

11mo

3

55 My take on Michael Littman on "The HCI of HAI"

Alex Flint

1y

4

12 Multi-Agent Inverse Reinforcement Learning: Suboptimal Demonstrations and Alternative Solution Concepts

sage_bergerson

1y

0

34 Making a Difference Tempore: Insights from 'Reinforcement Learning: An Introduction'

TurnTrout

4y

6

26 Reinforcement learning with imperceptible rewards

Vanessa Kosoy

3y

1

25 Reinforcement Learning in the Iterated Amplification Framework

William_S

3y

12

17 Evolution as Backstop for Reinforcement Learning: multi-level paradigms

gwern

3y

0

13 Can coherent extrapolated volition be estimated with Inverse Reinforcement Learning?

Jade Bishop

3y

5

6 How is reinforcement learning possible in non-sentient agents?

SomeoneKind

1y

5

8 What messy problems do you see Deep Reinforcement Learning applicable to?

Riccardo Volpato

2y

0