Update units/en/unit8/additional-readings.mdx

Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
This commit is contained in:
Thomas Simonini
2023-01-04 11:15:35 +01:00
committed by GitHub
parent fc4b52d138
commit 935da988bc

View File

@@ -13,7 +13,7 @@ These are **optional readings** if you want to go deeper.
## PPO Implementation details
- [The 37 Implementation Details of Proximal Policy Optimization](https://ppo-details.cleanrl.dev//2021/11/05/ppo-implementation-details/)
- [The 37 Implementation Details of Proximal Policy Optimization](https://iclr-blog-track.github.io/2022/03/25/ppo-implementation-details/)
- [Part 1 of 3 — Proximal Policy Optimization Implementation: 11 Core Implementation Details](https://www.youtube.com/watch?v=MEt6rrxH8W4)
## Importance Sampling