Update README.md

2026-06-16 23:18:59 +08:00 · 2022-04-21 15:24:29 +02:00
parent 044f4852e9
commit 1bef92bc6f
1 changed files with 43 additions and 29 deletions
--- a/README.md
+++ b/README.md
@@ -1,54 +1,68 @@
-# The Hugging Face Deep Reinforcement Learning Bootcamp 🤗
+# The Hugging Face Deep Reinforcement Learning Class 🤗

-In this free bootcamp, you will:
+In this free course, you will:

 - 📖 Study Deep Reinforcement Learning in **theory and practice**.
 - 🧑‍💻 Learn to **use famous Deep RL libraries** such as Stable Baselines3, RL Baselines3 Zoo, and RLlib.
 - 🤖 Train agents in **unique environments** such as SnowballFight, Huggy the Doggo 🐶, and classical ones such as Space Invaders and PyBullet.
- 💾 **Publish your trained agents in one line of code to the Hub**. But also **download powerful agents from the community**.
+- 💾 **Publish your trained agents in one line of code to the Hugging Face Hub**. But also **download powerful agents from the community**.
+- 🏆 **Participate in challenges** where you going to evaluate your agents against other teams.

 ➡️➡️➡️ Don't forget to sign up here: https://forms.gle/4bbgzs3oVZMjgDed7

 The best way to keep in touch is to **join our discord server to exchange with the community and with us** 👉🏻 https://discord.gg/aYka4Yhff9

-And don't forget to share with your friends who want to learn 🤗 !
+And don't forget to share with your friends who want to learn 🤗!

-## The Syllabus
+## The Syllabus 🏗️

-# Unit 1: An Introduction to Deep Reinforcement Learning
-  - Theory: An Introduction to Deep Reinforcement Learning
-  - Hands-on: Train your first agent with SB3
-
-# Bonus: Train Huggy the Doggo
-
-# Unit 2: Q-Learning
-
-# Unit 3: Deep Q-Learning and improvements
+| May, the 4th  | An Introduction to Deep Reinforcement Learning           | Train a Deep Reinforcement Learning lander agent to land correctly on the Moon 🌕 using Stable-Baselines3 |
+|---------------|----------------------------------------------------------|----------------------------------------------------------------------------------------------------------|
+| May, the 11th | Bonus                                                    | Train Huggy the Doggo to catch the stick                                                                 |
+| May, the 18th | Q-Learning                                               | Train an agent to cross a Frozen lake in this new version of the environment.                                                                                                        |
+| June, the 1st | Deep Q-Learning and improvements                         | Train a Deep Q-Learning agent to play Space Invaders                                                     |
+|               | Policy based methods                                     | 🏗️                                                                                                        |
+|               | Actor Critic Methods                                     | 🏗️                                                                                                        |
+|               | Proximal Policy Optimization (PPO)                       | 🏗️                                                                                                        |
+|               | Decision Transformers and offline Reinforcement Learning | 🏗️                                                                                                        |
+|               | Towards better explorations methods                      | 🏗️                                                                                                        |


-# Unit 4: Policy based methods
+## The library you'll learn during this course
+- [Stable-Baselines3](https://github.com/DLR-RM/stable-baselines3)
+- [RL Baselines3 Zoo](https://github.com/DLR-RM/rl-baselines3-zoo)
+- [RLlib](https://docs.ray.io/en/latest/rllib/index.html)
+- [CleanRL](https://github.com/vwxyzjn/cleanrl)
+- More to come 🏗️
+
+## The Environments you'll use
+### Custom environments made by Hugging Face Team using Unity and Godot
+- Huggy the Doggo 🐶
+(Huggy the Doggo is based on [Unity's Puppo the Corgi work](https://blog.unity.com/technology/puppo-the-corgi-cuteness-overload-with-the-unity-ml-agents-toolkit))


-# Unit 5: Actor Critic Methods
+
+- SnowballFight ☃️
+[snowballfight.gif](./assets/img/snowballfight.gif)
+👉 Play it here : https://huggingface.co/spaces/ThomasSimonini/SnowballFight


-# Unit 6: Proximal Policy Optimization (PPO)
+- More to come


-# Unit 7: Decision Transformers 
+# Gym classic controls environments
+- Lunar-Lander v2:


- Introduction to Deep Reinforcement Learning
-    - Theory: C1 DRLC Introduction to Deep RL
-    - Practice: Lunar Lander
-    - Library: SB3
-    - Bonus: Huggy

- Value based methods: Q-Learning
-    - Theory: Q-Learning
-    - Practice: Frozen lake updated version
+# PyBullet
+
+
+# Gym Atari environments
+
+
+# MLAgents environments
+
+

- Deep Q-Learning and improvements
-    - Theory: Deep Q-Learning and DDQN
-    - Practice: