Go Back
Choose this branch
You can't go any further
meritocratic
regular
democratic
hot
top
alive
15 posts
Inverse Reinforcement Learning
Road To AI Safety Excellence
35 posts
Reinforcement Learning
24
Is CIRL a promising agenda?
Chris_Leong
6mo
12
10
A Survey of Foundational Methods in Inverse Reinforcement Learning
adamk
3mo
0
92
Book Review: Human Compatible
Scott Alexander
2y
6
56
Thoughts on "Human-Compatible"
TurnTrout
3y
35
49
Book review: Human Compatible
PeterMcCluskey
2y
2
63
RAISE is launching their MVP
3y
1
46
Learning biases and rewards simultaneously
Rohin Shah
3y
3
38
Model Mis-specification and Inverse Reinforcement Learning
Owain_Evans
4y
3
20
RAISE AI Safety prerequisites map entirely in one post
RAISE
3y
5
19
AI Safety Prerequisites Course: Basic abstract representations of computation
RAISE
3y
2
12
IRL 1/8: Inverse Reinforcement Learning and the problem of degeneracy
RAISE
3y
2
7
Our plan for 2019-2020: consulting for AI Safety education
RAISE
3y
17
10
Biased reward-learning in CIRL
Stuart_Armstrong
4y
3
4
CIRL Wireheading
tom4everitt
5y
0
218
Reward is not the optimization target
TurnTrout
4mo
97
5
AGIs may value intrinsic rewards more than extrinsic ones
catubc
1mo
6
84
Jitters No Evidence of Stupidity in RL
1a3orn
1y
18
63
My take on Michael Littman on "The HCI of HAI"
Alex Flint
1y
4
11
RLHF
Ansh Radhakrishnan
7mo
5
26
Reinforcement learning with imperceptible rewards
Vanessa Kosoy
3y
1
32
Making a Difference Tempore: Insights from 'Reinforcement Learning: An Introduction'
TurnTrout
4y
6
25
Reinforcement Learning in the Iterated Amplification Framework
William_S
3y
12
21
Evolution as Backstop for Reinforcement Learning: multi-level paradigms
gwern
3y
0
45
Reinforcement Learning: A Non-Standard Introduction (Part 1)
royf
10y
19
11
Can coherent extrapolated volition be estimated with Inverse Reinforcement Learning?
Jade Bishop
3y
5
26
"Human-level control through deep reinforcement learning" - computer learns 49 different games
skeptical_lurker
7y
19
17
Delegative Inverse Reinforcement Learning
Vanessa Kosoy
5y
0
17
Cooperative Inverse Reinforcement Learning vs. Irrational Human Preferences
orthonormal
6y
0