From 40d3251a68f8a53358c4df38116d0ba0daaf6dfd Mon Sep 17 00:00:00 2001 From: Nikita Melkozerov Date: Sun, 14 Aug 2022 19:15:20 +0200 Subject: [PATCH] Update additional resources for PG chapter. --- unit5/README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/unit5/README.md b/unit5/README.md index 419b0df..2b50dbe 100644 --- a/unit5/README.md +++ b/unit5/README.md @@ -42,6 +42,7 @@ You can work directly **with the colab notebook, which allows you not to have to ## Additional readings 📚 - [Foundations of Deep RL Series, L3 Policy Gradients and Advantage Estimation by Pieter Abbeel](https://youtu.be/AKbX1Zvo7r8) - [Policy Gradient Algorithms](https://lilianweng.github.io/posts/2018-04-08-policy-gradient/) +- [An Intuitive Explanation of Policy Gradient](https://towardsdatascience.com/an-intuitive-explanation-of-policy-gradient-part-1-reinforce-aa4392cbfd3c) ## How to make the most of this course