diff --git a/units/en/unit2/hands-on.mdx b/units/en/unit2/hands-on.mdx index c189dd9..6661341 100644 --- a/units/en/unit2/hands-on.mdx +++ b/units/en/unit2/hands-on.mdx @@ -353,7 +353,7 @@ def greedy_policy(Qtable, state): return action ``` -##Define the epsilon-greedy policy 🤖 +## Define the epsilon-greedy policy 🤖 Epsilon-greedy is the training policy that handles the exploration/exploitation trade-off. diff --git a/units/en/unitbonus1/train.mdx b/units/en/unitbonus1/train.mdx index 1a2df6a..235606e 100644 --- a/units/en/unitbonus1/train.mdx +++ b/units/en/unitbonus1/train.mdx @@ -26,12 +26,7 @@ By using Google Colab, **you can focus on learning and experimenting without wor In this notebook, we'll reinforce what we learned in the first Unit by **teaching Huggy the Dog to fetch the stick and then play with it directly in your browser** -⬇️ Here is an example of what **you will achieve at the end of the unit.** ⬇️ (launch ▶ to see) - -```python -%%html - -``` +Huggy ### The environment 🎮