Balaji Varatharajan
87fcfeb9bb
Update variance-problem.mdx
...
Hi, I've a blog titled [High Variance in Policy gradients](https://balajiai.github.io/high_variance_in_policy_gradients ) which also explains about the variance problem in policy gradient and techniques for variance reduction such as baseline and actor-critics method.
I think, it would be valuable to this course readers. So I'm adding it to the reading-list.
Thanks!
2024-02-17 15:16:29 +05:30
Thomas Simonini
6ab84a4e8e
Using wget instead
2024-02-16 17:35:15 +01:00
Thomas Simonini
d7ec1b4ae3
Merge pull request #477 from RichardKhanhWin/unit1-typo-fix
...
Unit 1 typo fixes
2024-01-29 10:50:24 +01:00
Richard Khanh Manh Nguyen
108870e2c9
Update unit1.ipynb
...
Fix typo.
2024-01-28 01:03:44 -06:00
Richard Khanh Manh Nguyen
da63862afe
Update hands-on.mdx
...
Fix typo
2024-01-28 00:57:15 -06:00
Thomas Simonini
04f2277489
Merge pull request #474 from huggingface/ThomasSimonini/Unit5Update
...
Update Unit 5
2024-01-24 10:21:46 +01:00
Thomas Simonini
193f63bd9c
Update hands-on.mdx
2024-01-24 10:19:28 +01:00
Thomas Simonini
07ff43dd8d
Update wget with GitHub link instead
2024-01-24 10:18:16 +01:00
Thomas Simonini
605ce608cf
Merge pull request #466 from lutzvdb/patch-2
...
Update mid-way-recap.mdx
2024-01-24 10:09:36 +01:00
Thomas Simonini
4966807d7a
Merge pull request #473 from huggingface/ThomasSimonini/HuggyUpdate
...
Update Huggy
2024-01-24 10:07:16 +01:00
Thomas Simonini
fdf781ed82
Update train.mdx
2024-01-24 10:03:32 +01:00
Thomas Simonini
dab558b310
Update wget with a Github Repository
2024-01-24 10:01:41 +01:00
Lutz von der Burchard
ca29ddfbf9
Update mid-way-recap.mdx
...
Compare issue 451 (https://github.com/huggingface/deep-rl-class/issues/451 )
2024-01-15 09:45:50 +01:00
Thomas Simonini
32d5564236
Merge pull request #454 from lutzvdb/patch-1
...
Added clarification to the meaning of the rows of the Q-table
2024-01-15 09:42:01 +01:00
Thomas Simonini
6dc5937cbb
Merge pull request #444 from ashwinsnambiar/patch-1
...
Update train.mdx : this commit fixes #443
2024-01-15 09:40:56 +01:00
Thomas Simonini
81a364eafd
Update run-id
2024-01-15 09:34:21 +01:00
Thomas Simonini
e344e69f9d
Merge pull request #459 from e-dong/edit-space-war-student-works
...
Updated links for Space War project
2024-01-15 09:22:55 +01:00
Eric Dong
bd8a3f87f2
Updated student works
2024-01-03 13:05:13 -06:00
Thomas Simonini
0bfa919876
Merge pull request #445 from varun-sappa/main
...
This PR solves issue #434
2024-01-02 10:12:27 +01:00
Thomas Simonini
cdb5982872
Merge pull request #448 from Ivan-267/patch-1
...
Small typo correction on the Godot-RL section
2024-01-02 10:11:15 +01:00
Thomas Simonini
21a717c70d
Merge pull request #455 from fzyzcjy/patch-2
...
Super tiny fix typo
2024-01-02 10:07:11 +01:00
Thomas Simonini
b481d711c2
Merge pull request #458 from lunarflu/main
...
Update role assignment channel
2024-01-02 09:56:49 +01:00
Adam Molnar
c753f11238
Update setup.mdx
2024-01-01 09:52:06 +01:00
fzyzcjy
5a9389744e
Update visualize.mdx
2023-12-30 14:32:07 +08:00
Lutz von der Burchard
162110aba9
Added clarification to the meaning of the rows of the Q-table
2023-12-28 11:00:33 +01:00
Ivan-267
d3ab17f1aa
Update godotrl.mdx
2023-12-18 16:01:01 +01:00
Varun Sappa
a8dfa2bbd0
Update hands-on.mdx
2023-12-18 12:28:48 +05:30
Varun Sappa
4f61efab80
Update hands-on.mdx
2023-12-18 12:28:03 +05:30
ashwinsnambiar
ed1638df55
Update train.mdx
...
ML Agents parameter --run-id : misspelled with underscore instead of hypen
2023-12-16 22:05:51 +01:00
Thomas Simonini
e7ecdffd41
Merge pull request #437 from fardinafdideh/unit3-deep-q-algorithm
...
unit3 | deep-q-algorithm | catastrophic forgetting
2023-12-12 09:29:00 +01:00
Thomas Simonini
aef9bdb042
Merge pull request #431 from josejuanmartinez/unit-7-quiz
...
Unit 7 quiz
2023-12-11 18:13:55 +01:00
DESKTOP-AENDA0E\Fardin
8692616b7a
unit3 | deep-q-algorithm | catastrophic forgetting
2023-12-11 11:08:54 +01:00
Jose J. Martinez
494751c447
Unit 7 quiz
2023-12-07 16:54:34 +00:00
Thomas Simonini
21df13744c
Merge pull request #430 from josejuanmartinez/unit-5-quiz
...
Unit 5 quiz and some rewording for Unit 6
2023-12-07 14:25:46 +01:00
Jose J. Martinez
abd4a56c32
Unit 5 quiz and rewording of unit 6
2023-12-06 18:30:51 +00:00
Jose J. Martinez
4237fe92c4
Merge branch 'main' into unit-5-quiz
2023-12-06 18:00:37 +00:00
Juan Martinez
dd879809f6
Merge branch 'huggingface:main' into main
2023-12-06 17:59:29 +00:00
Thomas Simonini
64e049c36a
Merge pull request #429 from josejuanmartinez/unit-6-quiz
...
Creates quiz for unit 6
2023-12-06 12:45:12 +01:00
Juan Martinez
f41bf2c5fb
Fixes missing commas
2023-12-06 11:36:29 +00:00
Juan Martinez
f7c510a063
Adds newline after ###
2023-12-06 11:12:38 +00:00
Juan Martinez
40cf7684e5
Fixes typo and comma(s)
2023-12-06 11:10:43 +00:00
Juan Martinez
dbd0d00000
Update _toctree.yml
2023-12-04 17:29:45 +00:00
Juan Martinez
cc89254cf5
Create quiz for unit 5
2023-12-04 17:29:05 +00:00
Juan Martinez
9614d3d51b
Delete units/en/unit5/quiz.mdx
2023-12-04 17:28:43 +00:00
Juan Martinez
a5d8d6badb
Create quiz for unit 5
2023-12-04 17:27:47 +00:00
Juan Martinez
306d4084c2
Update _toctree.yml
2023-12-04 16:32:22 +00:00
Juan Martinez
57678da563
Update quiz.mdx
2023-12-03 19:16:18 +00:00
Juan Martinez
a31043822e
Create quiz for unit 6
2023-12-03 18:58:37 +00:00
Thomas Simonini
93c5115ed6
Merge pull request #424 from huggingface/ThomasSimonini/StudentProjects
...
Add SpaceScavangerAI
2023-11-22 18:00:07 +01:00
Thomas Simonini
d7ead9c94c
Update student-works.mdx
2023-11-22 17:59:55 +01:00