Tree of Tags

Go Back

Choose this branch

You can't go any further

meritocratic regular democratic

hot top alive

15 posts Inverse Reinforcement Learning Road To AI Safety Excellence

35 posts Reinforcement Learning

25 Is CIRL a promising agenda?

Chris_Leong

6mo

12

1 (C)IRL is not solely a learning process

Stuart_Armstrong

6y

0

3 CIRL Wireheading

tom4everitt

5y

0

37 Book review: Human Compatible

PeterMcCluskey

2y

2

63 Thoughts on "Human-Compatible"

TurnTrout

3y

35

18 Our plan for 2019-2020: consulting for AI Safety education

RAISE

3y

17

28 AI Safety Prerequisites Course: Basic abstract representations of computation

RAISE

3y

2

67 RAISE is launching their MVP

3y

1

8 Biased reward-learning in CIRL

Stuart_Armstrong

4y

3

20 IRL 1/8: Inverse Reinforcement Learning and the problem of degeneracy

RAISE

3y

2

33 Model Mis-specification and Inverse Reinforcement Learning

Owain_Evans

4y

3

18 RAISE AI Safety prerequisites map entirely in one post

RAISE

3y

5

77 Book Review: Human Compatible

Scott Alexander

2y

6

16 A Survey of Foundational Methods in Inverse Reinforcement Learning

adamk

3mo

0

252 Reward is not the optimization target

TurnTrout

4mo

97

8 AGIs may value intrinsic rewards more than extrinsic ones

catubc

1mo

6

5 What messy problems do you see Deep Reinforcement Learning applicable to?

Riccardo Volpato

2y

0

0 Inverse reinforcement learning on self, pre-ontology-change

Stuart_Armstrong

7y

0

4 Some work on connecting UDT and Reinforcement Learning

IAFF-User-111

7y

0

14 Cooperative Inverse Reinforcement Learning vs. Irrational Human Preferences

orthonormal

6y

0

4 Modeling the capabilities of advanced AI systems as episodic reinforcement learning

jessicata

6y

0

2 Vector-Valued Reinforcement Learning

orthonormal

6y

0

0 Reward/value learning for reinforcement learning

Stuart_Armstrong

5y

0

15 Delegative Inverse Reinforcement Learning

Vanessa Kosoy

5y

0

1 Delegative Reinforcement Learning with a Merely Sane Advisor

Vanessa Kosoy

5y

2

13 Clarification: Behaviourism & Reinforcement

Zaine

10y

30

33 Making a Difference Tempore: Insights from 'Reinforcement Learning: An Introduction'

TurnTrout

4y

6

12 Can coherent extrapolated volition be estimated with Inverse Reinforcement Learning?

Jade Bishop

3y

5