Go Back
Choose this branch
You can't go any further
meritocratic
regular
democratic
hot
top
alive
15 posts
Inverse Reinforcement Learning
Road To AI Safety Excellence
35 posts
Reinforcement Learning
22
A Survey of Foundational Methods in Inverse Reinforcement Learning
adamk
3mo
0
26
Is CIRL a promising agenda?
Chris_Leong
6mo
12
70
Thoughts on "Human-Compatible"
TurnTrout
3y
35
62
Book Review: Human Compatible
Scott Alexander
2y
6
71
RAISE is launching their MVP
3y
1
36
Learning biases and rewards simultaneously
Rohin Shah
3y
3
37
AI Safety Prerequisites Course: Basic abstract representations of computation
RAISE
3y
2
25
Book review: Human Compatible
PeterMcCluskey
2y
2
29
Our plan for 2019-2020: consulting for AI Safety education
RAISE
3y
17
28
IRL 1/8: Inverse Reinforcement Learning and the problem of degeneracy
RAISE
3y
2
28
Model Mis-specification and Inverse Reinforcement Learning
Owain_Evans
4y
3
16
RAISE AI Safety prerequisites map entirely in one post
RAISE
3y
5
6
Biased reward-learning in CIRL
Stuart_Armstrong
4y
3
2
CIRL Wireheading
tom4everitt
5y
0
286
Reward is not the optimization target
TurnTrout
4mo
97
11
AGIs may value intrinsic rewards more than extrinsic ones
catubc
1mo
6
80
Jitters No Evidence of Stupidity in RL
1a3orn
1y
18
21
RLHF
Ansh Radhakrishnan
7mo
5
29
Scalar reward is not enough for aligned AGI
Peter Vamplew
11mo
3
55
My take on Michael Littman on "The HCI of HAI"
Alex Flint
1y
4
12
Multi-Agent Inverse Reinforcement Learning: Suboptimal Demonstrations and Alternative Solution Concepts
sage_bergerson
1y
0
34
Making a Difference Tempore: Insights from 'Reinforcement Learning: An Introduction'
TurnTrout
4y
6
26
Reinforcement learning with imperceptible rewards
Vanessa Kosoy
3y
1
25
Reinforcement Learning in the Iterated Amplification Framework
William_S
3y
12
17
Evolution as Backstop for Reinforcement Learning: multi-level paradigms
gwern
3y
0
13
Can coherent extrapolated volition be estimated with Inverse Reinforcement Learning?
Jade Bishop
3y
5
6
How is reinforcement learning possible in non-sentient agents?
SomeoneKind
1y
5
8
What messy problems do you see Deep Reinforcement Learning applicable to?
Riccardo Volpato
2y
0