Commit Graph

487 Commits

Author SHA1 Message Date
Thomas Simonini
d406e5bb08 Créé avec Colaboratory 2023-01-17 07:34:49 +01:00
Thomas Simonini
368b54970f Update _toctree.yml 2023-01-17 07:34:01 +01:00
Thomas Simonini
196b80e15b Update requirements-unit6.txt 2023-01-17 07:16:47 +01:00
Thomas Simonini
2a35c66ec5 Apply suggestions from code review
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
2023-01-16 18:08:36 +01:00
Thomas Simonini
be7f8a34f0 Update notebook 2023-01-02 12:44:57 +01:00
Thomas Simonini
f937f8c7db Update introduction.mdx 2023-01-02 10:26:55 +01:00
Thomas Simonini
1680476a04 Add unit6 WIP 2023-01-01 17:30:34 +01:00
simoninithomas
14bd94d574 Update conclusion 2023-01-01 17:29:07 +01:00
Thomas Simonini
b835b898fc Update conclusion.mdx 2022-12-31 20:36:44 +01:00
Thomas Simonini
be34a485d0 Update requirements-unit6.txt 2022-12-31 11:19:23 +01:00
Thomas Simonini
d733a98e39 Update requirements-unit6.txt 2022-12-31 11:17:49 +01:00
Thomas Simonini
526d5fd48c Create requirements-unit6.txt 2022-12-31 11:13:23 +01:00
simoninithomas
143f169a65 Adding reading resources 2022-12-30 19:05:40 +01:00
simoninithomas
5aeaf3b5c4 Adding updated A2C Unit 2022-12-30 19:01:28 +01:00
Thomas Simonini
935b712cda Merge pull request #151 from huggingface/ThomasSimonini/SmallUpdates
Small updates typos and others
2022-12-20 14:21:38 +01:00
Thomas Simonini
7b61d9f813 Update bellman-equation.mdx 2022-12-20 14:20:40 +01:00
Thomas Simonini
5f66e67419 Update mc-vs-td.mdx 2022-12-20 14:06:10 +01:00
Thomas Simonini
3bdc44cd35 Update bellman-equation.mdx 2022-12-20 14:05:29 +01:00
Thomas Simonini
beaef9b0a4 Update two-types-value-based-methods.mdx 2022-12-20 14:02:46 +01:00
Thomas Simonini
31dc00a52b Update additional-readings.mdx
Add make your own gym custom env
2022-12-20 13:59:12 +01:00
Thomas Simonini
630b80a00f Update hands-on.mdx 2022-12-20 13:54:08 +01:00
Thomas Simonini
74b724c7a7 Update links 2022-12-20 13:44:59 +01:00
Thomas Simonini
093bdb1ed8 Merge pull request #137 from ramon-rd/patch-1
Create glossary.mdx
2022-12-20 13:07:07 +01:00
Thomas Simonini
a37804cebf Update glossary.mdx 2022-12-20 13:06:31 +01:00
Thomas Simonini
c275b13ddf Update _toctree.yml 2022-12-20 13:04:35 +01:00
Thomas Simonini
f354d80e8c Merge pull request #108 from huggingface/ThomasSimonini/Unit3
Adding Unit 3: Deep Q-Learning and Optuna Bonus
2022-12-19 16:03:11 +01:00
Thomas Simonini
fd710896cf Update hands-on.mdx
- Add cleanrl link
- Some cleanups
2022-12-19 15:05:38 +01:00
Thomas Simonini
5aa9e72390 Add advice about saving the colab 2022-12-19 14:00:37 +01:00
Thomas Simonini
96b49481da Update hands-on.mdx 2022-12-19 13:53:38 +01:00
Thomas Simonini
bcbc168d6b Update colab 2022-12-19 12:44:34 +01:00
Thomas Simonini
33c02be800 Update hands-on.mdx 2022-12-19 12:44:13 +01:00
Thomas Simonini
d500baac63 Apply suggestions from code review
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
2022-12-19 12:18:26 +01:00
Omar Sanseviero
ae2550331a Fixing Unit 1 Fig reference
Fixing Unit 1 Fig reference
2022-12-18 22:26:27 +01:00
Andrey Voroshilov
a4cd53cd37 Fixing the reference, a) to the right Berkeley course (CS 188 and not CS 285) and b) to have a proper URL format 2022-12-18 12:24:23 -08:00
Thomas Simonini
e8d9c18482 Merge pull request #138 from Artachtron/main
Format corrections
2022-12-18 18:10:40 +01:00
Artagon
fc66ea7e4a Rephrasing for initial epsilon value 2022-12-17 22:33:02 +01:00
Artagon
96714cdb10 Cases consistency 2022-12-17 22:23:08 +01:00
Thomas Simonini
f0cd89b4b5 Merge pull request #141 from c1505/patch-1
Fixed typo.  `pip install` was written twice
2022-12-17 18:43:33 +01:00
Corey
f09570b79e Fixed typo. pip install was written twice 2022-12-17 12:02:01 -05:00
Artagon
a7d74befb0 Fix midsentence uppercase 'Policy' 2022-12-17 14:47:18 +01:00
Artagon
753ef67eae epsilon-greedy instead of epsilon greedy 2022-12-17 14:45:08 +01:00
Artagon
f913af7300 epsilon smaller or equal to 1.0 2022-12-17 14:39:40 +01:00
Artagon
0a4c6c6f2c fix redundant 'pair' and inconsistent Case. 2022-12-17 14:30:19 +01:00
Artagon
0c3616c03f Replace ** by <b> tags in figcaption 2022-12-16 20:34:24 +01:00
Artagon
0744d542ad Properly display π* 2022-12-16 20:31:49 +01:00
Thomas Simonini
1abf623feb Update notebook 2022-12-16 10:05:42 +01:00
Thomas Simonini
38d0b2c73a Merge branch 'main' into ThomasSimonini/Unit3 2022-12-16 10:04:21 +01:00
simoninithomas
e442f0832b Update Unit 3 2022-12-16 10:02:52 +01:00
simoninithomas
8ae92a3029 Merge branch 'ThomasSimonini/Unit3' of https://github.com/huggingface/deep-rl-class into ThomasSimonini/Unit3
Update the branch
2022-12-16 09:46:32 +01:00
simoninithomas
ed065ac128 Update Unit 3 and 4 2022-12-16 09:44:59 +01:00