![RL Course by David Silver - Lecture 5: Model Free Control RL Course by David Silver - Lecture 5: Model Free Control](/sites/default/files/styles/journey_thumbnail/public/2021-06/maxresdefault_16.jpg?itok=AYA3qr-1)
96 min
Intermediate
Video
Theory
Link to External Site
Dives into On Policy Monte-Carlo Control and Temporal Difference Learning, as well as Off-Policy Learning.
Radioactivity
0