Go Back
Choose this branch
You can't go any further
meritocratic
regular
democratic
hot
top
alive
15 posts
Inverse Reinforcement Learning
Road To AI Safety Excellence
35 posts
Reinforcement Learning
25
Is CIRL a promising agenda?
Chris_Leong
6mo
12
1
(C)IRL is not solely a learning process
Stuart_Armstrong
6y
0
3
CIRL Wireheading
tom4everitt
5y
0
37
Book review: Human Compatible
PeterMcCluskey
2y
2
63
Thoughts on "Human-Compatible"
TurnTrout
3y
35
18
Our plan for 2019-2020: consulting for AI Safety education
RAISE
3y
17
28
AI Safety Prerequisites Course: Basic abstract representations of computation
RAISE
3y
2
67
RAISE is launching their MVP
3y
1
8
Biased reward-learning in CIRL
Stuart_Armstrong
4y
3
20
IRL 1/8: Inverse Reinforcement Learning and the problem of degeneracy
RAISE
3y
2
33
Model Mis-specification and Inverse Reinforcement Learning
Owain_Evans
4y
3
18
RAISE AI Safety prerequisites map entirely in one post
RAISE
3y
5
77
Book Review: Human Compatible
Scott Alexander
2y
6
16
A Survey of Foundational Methods in Inverse Reinforcement Learning
adamk
3mo
0
252
Reward is not the optimization target
TurnTrout
4mo
97
8
AGIs may value intrinsic rewards more than extrinsic ones
catubc
1mo
6
5
What messy problems do you see Deep Reinforcement Learning applicable to?
Riccardo Volpato
2y
0
0
Inverse reinforcement learning on self, pre-ontology-change
Stuart_Armstrong
7y
0
4
Some work on connecting UDT and Reinforcement Learning
IAFF-User-111
7y
0
14
Cooperative Inverse Reinforcement Learning vs. Irrational Human Preferences
orthonormal
6y
0
4
Modeling the capabilities of advanced AI systems as episodic reinforcement learning
jessicata
6y
0
2
Vector-Valued Reinforcement Learning
orthonormal
6y
0
0
Reward/value learning for reinforcement learning
Stuart_Armstrong
5y
0
15
Delegative Inverse Reinforcement Learning
Vanessa Kosoy
5y
0
1
Delegative Reinforcement Learning with a Merely Sane Advisor
Vanessa Kosoy
5y
2
13
Clarification: Behaviourism & Reinforcement
Zaine
10y
30
33
Making a Difference Tempore: Insights from 'Reinforcement Learning: An Introduction'
TurnTrout
4y
6
12
Can coherent extrapolated volition be estimated with Inverse Reinforcement Learning?
Jade Bishop
3y
5