Commit Graph

1044 Commits

Author SHA1 Message Date
Balaji Varatharajan
87fcfeb9bb Update variance-problem.mdx
Hi, I've a blog titled [High Variance in Policy gradients](https://balajiai.github.io/high_variance_in_policy_gradients) which also explains about the variance problem in policy gradient and techniques for variance reduction such as baseline and actor-critics method.
I think, it would be valuable to this course readers. So I'm adding it to the reading-list.

Thanks!
2024-02-17 15:16:29 +05:30
Thomas Simonini
6ab84a4e8e Using wget instead 2024-02-16 17:35:15 +01:00
Thomas Simonini
d7ec1b4ae3 Merge pull request #477 from RichardKhanhWin/unit1-typo-fix
Unit 1 typo fixes
2024-01-29 10:50:24 +01:00
Richard Khanh Manh Nguyen
108870e2c9 Update unit1.ipynb
Fix typo.
2024-01-28 01:03:44 -06:00
Richard Khanh Manh Nguyen
da63862afe Update hands-on.mdx
Fix typo
2024-01-28 00:57:15 -06:00
Thomas Simonini
04f2277489 Merge pull request #474 from huggingface/ThomasSimonini/Unit5Update
Update Unit 5
2024-01-24 10:21:46 +01:00
Thomas Simonini
193f63bd9c Update hands-on.mdx 2024-01-24 10:19:28 +01:00
Thomas Simonini
07ff43dd8d Update wget with GitHub link instead 2024-01-24 10:18:16 +01:00
Thomas Simonini
605ce608cf Merge pull request #466 from lutzvdb/patch-2
Update mid-way-recap.mdx
2024-01-24 10:09:36 +01:00
Thomas Simonini
4966807d7a Merge pull request #473 from huggingface/ThomasSimonini/HuggyUpdate
Update Huggy
2024-01-24 10:07:16 +01:00
Thomas Simonini
fdf781ed82 Update train.mdx 2024-01-24 10:03:32 +01:00
Thomas Simonini
dab558b310 Update wget with a Github Repository 2024-01-24 10:01:41 +01:00
Lutz von der Burchard
ca29ddfbf9 Update mid-way-recap.mdx
Compare issue 451 (https://github.com/huggingface/deep-rl-class/issues/451)
2024-01-15 09:45:50 +01:00
Thomas Simonini
32d5564236 Merge pull request #454 from lutzvdb/patch-1
Added clarification to the meaning of the rows of the Q-table
2024-01-15 09:42:01 +01:00
Thomas Simonini
6dc5937cbb Merge pull request #444 from ashwinsnambiar/patch-1
Update train.mdx : this commit fixes #443
2024-01-15 09:40:56 +01:00
Thomas Simonini
81a364eafd Update run-id 2024-01-15 09:34:21 +01:00
Thomas Simonini
e344e69f9d Merge pull request #459 from e-dong/edit-space-war-student-works
Updated links for Space War project
2024-01-15 09:22:55 +01:00
Eric Dong
bd8a3f87f2 Updated student works 2024-01-03 13:05:13 -06:00
Thomas Simonini
0bfa919876 Merge pull request #445 from varun-sappa/main
This PR solves issue #434
2024-01-02 10:12:27 +01:00
Thomas Simonini
cdb5982872 Merge pull request #448 from Ivan-267/patch-1
Small typo correction on the Godot-RL section
2024-01-02 10:11:15 +01:00
Thomas Simonini
21a717c70d Merge pull request #455 from fzyzcjy/patch-2
Super tiny fix typo
2024-01-02 10:07:11 +01:00
Thomas Simonini
b481d711c2 Merge pull request #458 from lunarflu/main
Update role assignment channel
2024-01-02 09:56:49 +01:00
Adam Molnar
c753f11238 Update setup.mdx 2024-01-01 09:52:06 +01:00
fzyzcjy
5a9389744e Update visualize.mdx 2023-12-30 14:32:07 +08:00
Lutz von der Burchard
162110aba9 Added clarification to the meaning of the rows of the Q-table 2023-12-28 11:00:33 +01:00
Ivan-267
d3ab17f1aa Update godotrl.mdx 2023-12-18 16:01:01 +01:00
Varun Sappa
a8dfa2bbd0 Update hands-on.mdx 2023-12-18 12:28:48 +05:30
Varun Sappa
4f61efab80 Update hands-on.mdx 2023-12-18 12:28:03 +05:30
ashwinsnambiar
ed1638df55 Update train.mdx
ML Agents parameter --run-id : misspelled with underscore instead of hypen
2023-12-16 22:05:51 +01:00
Thomas Simonini
e7ecdffd41 Merge pull request #437 from fardinafdideh/unit3-deep-q-algorithm
unit3 | deep-q-algorithm | catastrophic forgetting
2023-12-12 09:29:00 +01:00
Thomas Simonini
aef9bdb042 Merge pull request #431 from josejuanmartinez/unit-7-quiz
Unit 7 quiz
2023-12-11 18:13:55 +01:00
DESKTOP-AENDA0E\Fardin
8692616b7a unit3 | deep-q-algorithm | catastrophic forgetting 2023-12-11 11:08:54 +01:00
Jose J. Martinez
494751c447 Unit 7 quiz 2023-12-07 16:54:34 +00:00
Thomas Simonini
21df13744c Merge pull request #430 from josejuanmartinez/unit-5-quiz
Unit 5 quiz and some rewording for Unit 6
2023-12-07 14:25:46 +01:00
Jose J. Martinez
abd4a56c32 Unit 5 quiz and rewording of unit 6 2023-12-06 18:30:51 +00:00
Jose J. Martinez
4237fe92c4 Merge branch 'main' into unit-5-quiz 2023-12-06 18:00:37 +00:00
Juan Martinez
dd879809f6 Merge branch 'huggingface:main' into main 2023-12-06 17:59:29 +00:00
Thomas Simonini
64e049c36a Merge pull request #429 from josejuanmartinez/unit-6-quiz
Creates quiz for unit 6
2023-12-06 12:45:12 +01:00
Juan Martinez
f41bf2c5fb Fixes missing commas 2023-12-06 11:36:29 +00:00
Juan Martinez
f7c510a063 Adds newline after ### 2023-12-06 11:12:38 +00:00
Juan Martinez
40cf7684e5 Fixes typo and comma(s) 2023-12-06 11:10:43 +00:00
Juan Martinez
dbd0d00000 Update _toctree.yml 2023-12-04 17:29:45 +00:00
Juan Martinez
cc89254cf5 Create quiz for unit 5 2023-12-04 17:29:05 +00:00
Juan Martinez
9614d3d51b Delete units/en/unit5/quiz.mdx 2023-12-04 17:28:43 +00:00
Juan Martinez
a5d8d6badb Create quiz for unit 5 2023-12-04 17:27:47 +00:00
Juan Martinez
306d4084c2 Update _toctree.yml 2023-12-04 16:32:22 +00:00
Juan Martinez
57678da563 Update quiz.mdx 2023-12-03 19:16:18 +00:00
Juan Martinez
a31043822e Create quiz for unit 6 2023-12-03 18:58:37 +00:00
Thomas Simonini
93c5115ed6 Merge pull request #424 from huggingface/ThomasSimonini/StudentProjects
Add SpaceScavangerAI
2023-11-22 18:00:07 +01:00
Thomas Simonini
d7ead9c94c Update student-works.mdx 2023-11-22 17:59:55 +01:00