mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-05-12 02:46:37 +08:00
Update two-methods.mdx
This commit is contained in:
@@ -54,7 +54,7 @@ We have two types of policies:
|
||||
</figure>
|
||||
|
||||
<figure>
|
||||
<img src="https://huggingface.co/datasets/huggingface-deep-rl-course/course-images/resolve/main/en/unit1/mario.jpg" alt="Mario"/>
|
||||
<img src="https://huggingface.co/datasets/huggingface-deep-rl-course/course-images/resolve/main/en/unit1/policy-based.png" alt="Policy Based"/>
|
||||
<figcaption>Given an initial state, our stochastic policy will output probability distributions over the possible actions at that state.</figcaption>
|
||||
</figure>
|
||||
|
||||
|
||||
Reference in New Issue
Block a user