From 4be6adb7f48f0834919df23713eff063e48588b3 Mon Sep 17 00:00:00 2001 From: Thomas Simonini Date: Tue, 28 Mar 2023 15:45:22 +0200 Subject: [PATCH 1/2] Update Course --- notebooks/unit3/unit3.ipynb | 1 + units/en/unit3/hands-on.mdx | 3 +++ units/en/unit7/self-play.mdx | 2 +- 3 files changed, 5 insertions(+), 1 deletion(-) diff --git a/notebooks/unit3/unit3.ipynb b/notebooks/unit3/unit3.ipynb index 1c4b17b..8587f14 100644 --- a/notebooks/unit3/unit3.ipynb +++ b/notebooks/unit3/unit3.ipynb @@ -289,6 +289,7 @@ }, "outputs": [], "source": [ + "pip install setuptools==65.5.0\n", "!pip install -r requirements.txt\n", "# Since colab uses Python 3.9 we need to add this installation\n", "!pip install gym[atari,accept-rom-license]==0.21.0" diff --git a/units/en/unit3/hands-on.mdx b/units/en/unit3/hands-on.mdx index 409d410..f0a7d4e 100644 --- a/units/en/unit3/hands-on.mdx +++ b/units/en/unit3/hands-on.mdx @@ -127,7 +127,10 @@ cd /content/rl-baselines3-zoo/ ``` ```bash +pip install setuptools==65.5.0 pip install -r requirements.txt +# Since colab uses Python 3.9 we need to add this installation +pip install gym[atari,accept-rom-license]==0.21.0 ``` ## Train our Deep Q-Learning Agent to Play Space Invaders 👾 diff --git a/units/en/unit7/self-play.mdx b/units/en/unit7/self-play.mdx index 8dd44c7..347695d 100644 --- a/units/en/unit7/self-play.mdx +++ b/units/en/unit7/self-play.mdx @@ -31,7 +31,7 @@ We do the same with self-play: - We **start with a copy of our agent as an opponent** this way, this opponent is on a similar level. - We **learn from it**, and when we acquire some skills, we **update our opponent with a more recent copy of our training policy**. -The theory behind self-play is not something new. It was already used by Arthur Samuel’s checker player system in the fifties and by Gerald Tesauro’s TD-Gammon in 1955. If you want to learn more about the history of self-play [check this very good blogpost by Andrew Cohen](https://blog.unity.com/technology/training-intelligent-adversaries-using-self-play-with-ml-agents) +The theory behind self-play is not something new. It was already used by Arthur Samuel’s checker player system in the fifties and by Gerald Tesauro’s TD-Gammon in 1995. If you want to learn more about the history of self-play [check this very good blogpost by Andrew Cohen](https://blog.unity.com/technology/training-intelligent-adversaries-using-self-play-with-ml-agents) ## Self-Play in MLAgents From d638afb028696df21bfc51043338cedb020fac75 Mon Sep 17 00:00:00 2001 From: Thomas Simonini Date: Tue, 28 Mar 2023 15:55:57 +0200 Subject: [PATCH 2/2] Change deadline From April to June --- units/en/unit0/introduction.mdx | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) diff --git a/units/en/unit0/introduction.mdx b/units/en/unit0/introduction.mdx index 84446b6..6262e31 100644 --- a/units/en/unit0/introduction.mdx +++ b/units/en/unit0/introduction.mdx @@ -52,21 +52,21 @@ The course is composed of: You can choose to follow this course either: -- *To get a certificate of completion*: you need to complete 80% of the assignments before the end of April 2023. -- *To get a certificate of honors*: you need to complete 100% of the assignments before the end of April 2023. +- *To get a certificate of completion*: you need to complete 80% of the assignments before the end of June 2023. +- *To get a certificate of honors*: you need to complete 100% of the assignments before the end of June 2023. - *As a simple audit*: you can participate in all challenges and do assignments if you want, but you have no deadlines. Both paths **are completely free**. Whatever path you choose, we advise you **to follow the recommended pace to enjoy the course and challenges with your fellow classmates.** -You don't need to tell us which path you choose. At the end of April, when we will verify the assignments **if you get more than 80% of the assignments done, you'll get a certificate.** +You don't need to tell us which path you choose. **If you get more than 80% of the assignments done, you'll get a certificate.** ## The Certification Process [[certification-process]] The certification process is **completely free**: -- *To get a certificate of completion*: you need to complete 80% of the assignments before the end of April 2023. -- *To get a certificate of honors*: you need to complete 100% of the assignments before the end of April 2023. +- *To get a certificate of completion*: you need to complete 80% of the assignments before the end of June 2023. +- *To get a certificate of honors*: you need to complete 100% of the assignments before the end of June 2023. Course certification