mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-04-13 16:49:44 +08:00
Merge pull request #388 from huggingface/ThomasSimonini/Update
Small Update
This commit is contained in:
@@ -353,7 +353,7 @@ def greedy_policy(Qtable, state):
|
||||
return action
|
||||
```
|
||||
|
||||
##Define the epsilon-greedy policy 🤖
|
||||
## Define the epsilon-greedy policy 🤖
|
||||
|
||||
Epsilon-greedy is the training policy that handles the exploration/exploitation trade-off.
|
||||
|
||||
|
||||
@@ -26,12 +26,7 @@ By using Google Colab, **you can focus on learning and experimenting without wor
|
||||
|
||||
In this notebook, we'll reinforce what we learned in the first Unit by **teaching Huggy the Dog to fetch the stick and then play with it directly in your browser**
|
||||
|
||||
⬇️ Here is an example of what **you will achieve at the end of the unit.** ⬇️ (launch ▶ to see)
|
||||
|
||||
```python
|
||||
%%html
|
||||
<video controls autoplay><source src="https://huggingface.co/datasets/huggingface-deep-rl-course/course-images/resolve/main/en/notebooks/unit-bonus1/huggy.mp4" type="video/mp4"></video>
|
||||
```
|
||||
<img src="https://huggingface.co/datasets/huggingface-deep-rl-course/course-images/resolve/main/en/notebooks/unit-bonus1/huggy.jpg" alt="Huggy"/>
|
||||
|
||||
### The environment 🎮
|
||||
|
||||
|
||||
Reference in New Issue
Block a user