mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-02-02 18:09:24 +08:00
Hi, I've a blog titled [High Variance in Policy gradients](https://balajiai.github.io/high_variance_in_policy_gradients) which also explains about the variance problem in policy gradient and techniques for variance reduction such as baseline and actor-critics method. I think, it would be valuable to this course readers. So I'm adding it to the reading-list. Thanks!