Balaji Varatharajan 87fcfeb9bb Update variance-problem.mdx
Hi, I've a blog titled [High Variance in Policy gradients](https://balajiai.github.io/high_variance_in_policy_gradients) which also explains about the variance problem in policy gradient and techniques for variance reduction such as baseline and actor-critics method.
I think, it would be valuable to this course readers. So I'm adding it to the reading-list.

Thanks!
2024-02-17 15:16:29 +05:30
2023-10-19 14:30:47 +02:00
2024-02-16 17:35:15 +01:00
2024-02-17 15:16:29 +05:30
2022-10-06 11:59:37 +02:00
2023-05-16 14:48:17 +02:00

The Hugging Face Deep Reinforcement Learning Course 🤗 (v2.0)

Thumbnail

If you like the course, don't hesitate to star this repository. This helps us 🤗.

This repository contains the Deep Reinforcement Learning Course mdx files and notebooks. The website is here: https://huggingface.co/deep-rl-course/unit0/introduction?fw=pt

Citing the project

To cite this repository in publications:

@misc{deep-rl-course,
  author = {Simonini, Thomas and Sanseviero, Omar},
  title = {The Hugging Face Deep Reinforcement Learning Class},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/huggingface/deep-rl-class}},
}
Description
This repo contain the syllabus of the Hugging Face Deep Reinforcement Learning Class.
Readme 55 MiB
Languages
MDX 58.7%
Jupyter Notebook 41.3%