mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-04-03 10:38:27 +08:00
Update additional resources for PG chapter.
This commit is contained in:
committed by
GitHub
parent
ffd19903dc
commit
40d3251a68
@@ -42,6 +42,7 @@ You can work directly **with the colab notebook, which allows you not to have to
|
||||
## Additional readings 📚
|
||||
- [Foundations of Deep RL Series, L3 Policy Gradients and Advantage Estimation by Pieter Abbeel](https://youtu.be/AKbX1Zvo7r8)
|
||||
- [Policy Gradient Algorithms](https://lilianweng.github.io/posts/2018-04-08-policy-gradient/)
|
||||
- [An Intuitive Explanation of Policy Gradient](https://towardsdatascience.com/an-intuitive-explanation-of-policy-gradient-part-1-reinforce-aa4392cbfd3c)
|
||||
|
||||
## How to make the most of this course
|
||||
|
||||
|
||||
Reference in New Issue
Block a user