mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-02-03 02:14:53 +08:00
Update SnowballTarget
This commit is contained in:
@@ -553,11 +553,9 @@
|
||||
"\n",
|
||||
"For this step it’s simple:\n",
|
||||
"\n",
|
||||
"1. Remember your repo-id\n",
|
||||
"1. Go here: https://huggingface.co/spaces/ThomasSimonini/ML-Agents-SnowballTarget\n",
|
||||
"\n",
|
||||
"2. Go here: https://huggingface.co/spaces/ThomasSimonini/ML-Agents-SnowballTarget\n",
|
||||
"\n",
|
||||
"3. Launch the game and put it in full screen by clicking on the bottom right button\n",
|
||||
"2. Launch the game and put it in full screen by clicking on the bottom right button\n",
|
||||
"\n",
|
||||
"<img src=\"https://huggingface.co/datasets/huggingface-deep-rl-course/course-images/resolve/main/en/unit7/snowballtarget_load.png\" alt=\"Snowballtarget load\"/>"
|
||||
],
|
||||
@@ -565,14 +563,16 @@
|
||||
"id": "VMc4oOsE0QiZ"
|
||||
}
|
||||
},
|
||||
{
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"source": [
|
||||
"1. In step 1, choose your model repository which is the model id (in my case ThomasSimonini/ppo-SnowballTarget).\n",
|
||||
"1. In step 1, type your username (your username is case sensitive: for instance, my username is ThomasSimonini not thomassimonini or ThOmasImoNInI) and click on the search button.",
|
||||
"\n",
|
||||
"2. In step 2, **choose what model you want to replay**:\n",
|
||||
" - I have multiple one, since we saved a model every 500000 timesteps.\n",
|
||||
" - But if I want the more recent I choose `SnowballTarget.onnx`\n",
|
||||
"2. In step 2, select your model repository.",
|
||||
"\n",
|
||||
"3. In step 3, **choose which model you want to replay**:",
|
||||
" - I have multiple ones, since we saved a model every 500000 timesteps.\n",
|
||||
" - But since I want the more recent, I choose `SnowballTarget.onnx`\n",
|
||||
"\n",
|
||||
"👉 What’s nice **is to try with different models step to see the improvement of the agent.**\n",
|
||||
"\n",
|
||||
|
||||
@@ -261,21 +261,22 @@ But now comes the best: **being able to visualize your agent online 👀.**
|
||||
|
||||
This step it's simple:
|
||||
|
||||
1. Remember your repo-id
|
||||
1. Go here: https://huggingface.co/spaces/ThomasSimonini/ML-Agents-SnowballTarget
|
||||
|
||||
2. Go here: https://huggingface.co/spaces/ThomasSimonini/ML-Agents-SnowballTarget
|
||||
|
||||
3. Launch the game and put it in full screen by clicking on the bottom right button
|
||||
2. Launch the game and put it in full screen by clicking on the bottom right button
|
||||
|
||||
<img src="https://huggingface.co/datasets/huggingface-deep-rl-course/course-images/resolve/main/en/unit7/snowballtarget_load.png" alt="Snowballtarget load"/>
|
||||
|
||||
1. In step 1, choose your model repository, which is the model id (in my case ThomasSimonini/ppo-SnowballTarget).
|
||||
1. In step 1, type your username (your username is case sensitive: for instance, my username is ThomasSimonini not thomassimonini or ThOmasImoNInI) and click on the search button.
|
||||
|
||||
2. In step 2, **choose what model you want to replay**:
|
||||
- I have multiple ones since we saved a model every 500000 timesteps.
|
||||
- But if I want the more recent I choose `SnowballTarget.onnx`
|
||||
2. In step 2, select your model repository.
|
||||
|
||||
3. In step 3, **choose which model you want to replay**:
|
||||
- I have multiple ones, since we saved a model every 500000 timesteps.
|
||||
- But since I want the most recent one, I choose `SnowballTarget.onnx`
|
||||
|
||||
👉 It's good **to try with different models steps to see the improvement of the agent.**
|
||||
|
||||
👉 It's nice to **try different model stages to see the improvement of the agent.**
|
||||
|
||||
And don't hesitate to share the best score your agent gets on discord in the #rl-i-made-this channel 🔥
|
||||
|
||||
|
||||
Reference in New Issue
Block a user