l4 trpo and ppo (foundations of deep rl series)
Published 2 years ago • 27K plays • Length 25:21Download video MP4
Download video MP3
Similar videos
-
41:01
deep rl bootcamp lecture 5: natural policy gradients, trpo, ppo
-
41:22
l3 policy gradients and advantage estimation (foundations of deep rl series)
-
1:16:10
l1 mdps, exact solution methods, max-ent rl (foundations of deep rl series)
-
18:14
l6 model-based rl (foundations of deep rl series)
-
11:05
ai learns to park - deep reinforcement learning
-
3:19
deep learning cars
-
18:14
cs885 lecture 15b: proximal policy optimization (presenter: ruifan yu)
-
19:50
an introduction to policy gradient methods - deep reinforcement learning
-
17:50
proximal policy optimization explained
-
12:12
l5 ddpg and sac (foundations of deep rl series)
-
34:09
l2 deep q-learning (foundations of deep rl series)
-
13:18
deep policy search class: trpo and ppo
-
11:05
trpo and acktr (rlvs 2021 version)