mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-04-01 09:40:26 +08:00
Update Unit 3
This commit is contained in:
@@ -1,5 +1,15 @@
|
||||
{
|
||||
"cells": [
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {
|
||||
"id": "view-in-github",
|
||||
"colab_type": "text"
|
||||
},
|
||||
"source": [
|
||||
"<a href=\"https://colab.research.google.com/github/huggingface/deep-rl-class/blob/ThomasSimonini%2FUnit3/notebooks/unit3/unit3.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
|
||||
]
|
||||
},
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"metadata": {
|
||||
@@ -32,15 +42,12 @@
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"source": [
|
||||
"TODO: ADD TEXT LIVE INFO\n",
|
||||
"### 🎮 Environments: \n",
|
||||
"\n",
|
||||
"TODO: ADD IF YOU HAVE QUESTIONS\n",
|
||||
"\n",
|
||||
"\n",
|
||||
"###🎮 Environments: \n",
|
||||
"- SpacesInvadersNoFrameskip-v4 \n",
|
||||
"\n",
|
||||
"###📚 RL-Library: \n",
|
||||
"### 📚 RL-Library: \n",
|
||||
"\n",
|
||||
"- [RL-Baselines3-Zoo](https://github.com/DLR-RM/rl-baselines3-zoo)"
|
||||
],
|
||||
"metadata": {
|
||||
@@ -100,7 +107,7 @@
|
||||
"## Prerequisites 🏗️\n",
|
||||
"Before diving into the notebook, you need to:\n",
|
||||
"\n",
|
||||
"🔲 📚 **Study Deep Q-Learning by reading Unit 3** 🤗 ADD LINK "
|
||||
"🔲 📚 **[Study Deep Q-Learning by reading Unit 3](https://huggingface.co/deep-rl-course/unit3/introduction)** 🤗 "
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -118,7 +125,13 @@
|
||||
"id": "QR0jZtYreSI5"
|
||||
},
|
||||
"source": [
|
||||
"# Let's train a Deep Q-Learning agent playing Atari' Space Invaders 👾 and upload it to the Hub."
|
||||
"# Let's train a Deep Q-Learning agent playing Atari' Space Invaders 👾 and upload it to the Hub.\n",
|
||||
"\n",
|
||||
"To validate this hands-on for the certification process, you need to push your trained model to the Hub and **get a result of >= 500**.\n",
|
||||
"\n",
|
||||
"To find your result, go to the leaderboard and find your model, **the result = mean_reward - std of reward**\n",
|
||||
"\n",
|
||||
"For more information about the certification process, check this section 👉 https://huggingface.co/deep-rl-course/en/unit0/introduction#certification-process"
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -719,7 +732,7 @@
|
||||
{
|
||||
"cell_type": "markdown",
|
||||
"source": [
|
||||
"See you on [Bonus unit 2](https://github.com/huggingface/deep-rl-class/tree/main/unit2#unit-2-introduction-to-q-learning)! 🔥 TODO CHANGE LINK"
|
||||
"See you on Bonus unit 2! 🔥 "
|
||||
],
|
||||
"metadata": {
|
||||
"id": "Kc3udPT-RcXc"
|
||||
@@ -738,7 +751,8 @@
|
||||
"metadata": {
|
||||
"colab": {
|
||||
"private_outputs": true,
|
||||
"provenance": []
|
||||
"provenance": [],
|
||||
"include_colab_link": true
|
||||
},
|
||||
"kernelspec": {
|
||||
"display_name": "Python 3 (ipykernel)",
|
||||
|
||||
Reference in New Issue
Block a user