mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-07-03 11:16:24 +08:00
Update units/en/unit8/additional-readings.mdx
Co-authored-by: Antonin RAFFIN <antonin.raffin@ensta.org>
This commit is contained in:
@@ -13,7 +13,7 @@ These are **optional readings** if you want to go deeper.
|
||||
|
||||
## PPO Implementation details
|
||||
|
||||
- [The 37 Implementation Details of Proximal Policy Optimization](https://ppo-details.cleanrl.dev//2021/11/05/ppo-implementation-details/)
|
||||
- [The 37 Implementation Details of Proximal Policy Optimization](https://iclr-blog-track.github.io/2022/03/25/ppo-implementation-details/)
|
||||
- [Part 1 of 3 — Proximal Policy Optimization Implementation: 11 Core Implementation Details](https://www.youtube.com/watch?v=MEt6rrxH8W4)
|
||||
|
||||
## Importance Sampling
|
||||
|
||||
Reference in New Issue
Block a user