Merge pull request #312 from huggingface/GymnasiumUpdate/Unit4

Update Unit 4
2026-06-15 14:36:45 +08:00 · 2023-05-03 17:39:51 +02:00
parent 152094b806 1f0edd7ddd
commit 3622474759
4 changed files with 1644 additions and 570 deletions
--- a/notebooks/unit4/unit4.ipynb
+++ b/notebooks/unit4/unit4.ipynb
@@ -7,7 +7,7 @@
        "colab_type": "text"
      },
      "source": [
-        "<a href=\"https://colab.research.google.com/github/huggingface/deep-rl-class/blob/ThomasSimonini%2FPG/notebooks/unit4/unit4.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
+        "<a href=\"https://colab.research.google.com/github/huggingface/deep-rl-class/blob/GymnasiumUpdate%2FUnit4/notebooks/unit4/unit4.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
      ]
    },
    {
@@ -229,6 +229,15 @@
        "- `gym-games`: Extra gym environments made with PyGame.\n",
        "- `huggingface_hub`: 🤗 works as a central place where anyone can share and explore models and datasets. It has versioning, metrics, visualizations, and other features that will allow you to easily collaborate with others.\n",
        "\n",
+        "You may be wondering why we install gym and not gymnasium, a more recent version of gym? **Because the gym-games we are using are not updated yet with gymnasium**. \n",
+        "\n",
+        "The differences you'll encounter here:\n",
+        "- In `gym` we don't have `terminated` and `truncated` but only `done`.\n",
+        "- In `gym` using `env.step()` returns `state, reward, done, info`\n",
+        "\n",
+        "You can learn more about the differences between Gym and Gymnasium here 👉 https://gymnasium.farama.org/content/migration-guide/\n",
+        "\n",
+        "\n",
        "You can see here all the Reinforce models available 👉 https://huggingface.co/models?other=reinforce\n",
        "\n",
        "And you can find all the Deep Reinforcement Learning models here 👉 https://huggingface.co/models?pipeline_tag=reinforcement-learning\n"
--- a/unit4.ipynb
+++ b/unit4.ipynb
--- a/unit4/unit4.ipynb
+++ b/unit4/unit4.ipynb
--- a/units/en/unit4/hands-on.mdx
+++ b/units/en/unit4/hands-on.mdx
@@ -37,6 +37,9 @@ And you can check your progress here 👉 https://huggingface.co/spaces/ThomasSi

 [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/huggingface/deep-rl-class/blob/master/notebooks/unit4/unit4.ipynb)

+We strongly **recommend students use Google Colab for the hands-on exercises** instead of running them on their personal computers. 
+
+By using Google Colab, **you can focus on learning and experimenting without worrying about the technical aspects** of setting up your environments.

 # Unit 4: Code your first Deep Reinforcement Learning Algorithm with PyTorch: Reinforce. And test its robustness 💪

@@ -134,6 +137,14 @@ The first step is to install the dependencies. We’ll install multiple ones:
 - `gym-games`: Extra gym environments made with PyGame.
 - `huggingface_hub`: The Hub works as a central place where anyone can share and explore models and datasets. It has versioning, metrics, visualizations, and other features that will allow you to easily collaborate with others.

+You may be wondering why we install gym and not gymnasium, a more recent version of gym? **Because the gym-games we are using are not updated yet with gymnasium**. 
+
+The differences you'll encounter here:
+- In `gym` we don't have `terminated` and `truncated` but only `done`.
+- In `gym` using `env.step()` returns `state, reward, done, info`
+
+You can learn more about the differences between Gym and Gymnasium here 👉 https://gymnasium.farama.org/content/migration-guide/
+
 You can see here all the Reinforce models available 👉 https://huggingface.co/models?other=reinforce

 And you can find all the Deep Reinforcement Learning models here 👉 https://huggingface.co/models?pipeline_tag=reinforcement-learning