diff --git a/units/en/unit8/additional-readings.mdx b/units/en/unit8/additional-readings.mdx index 7425293..89196f9 100644 --- a/units/en/unit8/additional-readings.mdx +++ b/units/en/unit8/additional-readings.mdx @@ -13,7 +13,7 @@ These are **optional readings** if you want to go deeper. ## PPO Implementation details -- [The 37 Implementation Details of Proximal Policy Optimization](https://ppo-details.cleanrl.dev//2021/11/05/ppo-implementation-details/) +- [The 37 Implementation Details of Proximal Policy Optimization](https://iclr-blog-track.github.io/2022/03/25/ppo-implementation-details/) - [Part 1 of 3 — Proximal Policy Optimization Implementation: 11 Core Implementation Details](https://www.youtube.com/watch?v=MEt6rrxH8W4) ## Importance Sampling