Abhishek Rana
763c7c0c97
Update hands-on.mdx
2024-05-01 16:59:33 +05:30
Thomas Simonini
732d543677
Merge branch 'main' into pierrecounathe/unit-4-propositions
2024-04-19 08:14:34 +02:00
S-N-O-R-L-A-X
ddcdc8cd3a
fix: fix gap between math signs and text
2024-04-07 16:10:57 +08:00
S-N-O-R-L-A-X
f123308a28
fix: fix doc equations
2024-04-04 19:33:25 +08:00
Thomas Simonini
72473f08a8
Update pg-theorem.mdx
2024-03-05 10:45:12 +01:00
Thomas Simonini
9d777a01b0
Update pg-theorem.mdx
2024-03-05 10:40:03 +01:00
Thomas Simonini
0e55db0106
Merge pull request #493 from S-N-O-R-L-A-X/patch-1
...
fix error in quiz2.mdx
2024-03-05 10:33:06 +01:00
Thomas Simonini
df5ffa3917
Merge pull request #499 from alexpalms/main
...
Add DIAMBRA Arena RL Environment
2024-03-04 17:44:12 +01:00
Thomas Simonini
bf5a72ad6c
Merge pull request #420 from fzyzcjy/patch-1
...
Super tiny fix format
2024-03-04 16:59:57 +01:00
Alessandro Palmas
382c69caa4
Update units/en/unitbonus3/envs-to-try.mdx
...
Co-authored-by: Thomas Simonini <simonini.thomas.pro@gmail.com >
2024-03-02 14:59:06 -05:00
Alessandro Palmas
e8b6db8a32
Update units/en/unitbonus3/envs-to-try.mdx
...
Co-authored-by: Thomas Simonini <simonini.thomas.pro@gmail.com >
2024-03-02 14:58:10 -05:00
Alessandro Palmas
2db3b14f4a
Update diambra arena image
2024-03-01 23:32:13 -05:00
Alessandro Palmas
cd30c90961
Updated page
2024-03-01 23:28:19 -05:00
Thomas Simonini
4b416005e7
Merge pull request #487 from PierreCounathe/pierrecounathe/unit-5-propositions
...
Unit 5 Proposal Updates
2024-03-01 15:57:44 +01:00
Thomas Simonini
1da2ef65ee
Merge pull request #489 from MrPuppeteer/patch-1
...
Fix typo in discord101.mdx
2024-03-01 15:55:40 +01:00
Thomas Simonini
1b09e7cbac
Merge pull request #490 from BalajiAI/patch-1
...
Update variance-problem.mdx
2024-03-01 15:53:29 +01:00
Thomas Simonini
00c6120fe6
Update hands-on.mdx
2024-02-26 10:10:56 +01:00
Thomas Simonini
0535a45230
Update hands-on.mdx
...
* Change pyramids environment
2024-02-26 09:59:54 +01:00
SNORLAX
5e5ea78e63
fix error in quiz2.mdx
2024-02-24 20:37:58 +08:00
Alessandro Palmas
f4e21ebc8d
Add some links
2024-02-23 00:10:43 -05:00
Alessandro Palmas
7bf227dea2
Add DIAMBRA to envs to try
2024-02-22 23:53:50 -05:00
Balaji Varatharajan
87fcfeb9bb
Update variance-problem.mdx
...
Hi, I've a blog titled [High Variance in Policy gradients](https://balajiai.github.io/high_variance_in_policy_gradients ) which also explains about the variance problem in policy gradient and techniques for variance reduction such as baseline and actor-critics method.
I think, it would be valuable to this course readers. So I'm adding it to the reading-list.
Thanks!
2024-02-17 15:16:29 +05:30
Bagas N
62183fd456
Fix typo in discord101.mdx
2024-02-12 12:52:59 +07:00
Pierre Counathe
5d6a406589
nits
2024-02-09 19:32:40 -08:00
Pierre Counathe
33b97e99ec
proposal
2024-02-09 19:21:04 -08:00
Richard Khanh Manh Nguyen
da63862afe
Update hands-on.mdx
...
Fix typo
2024-01-28 00:57:15 -06:00
Thomas Simonini
193f63bd9c
Update hands-on.mdx
2024-01-24 10:19:28 +01:00
Thomas Simonini
605ce608cf
Merge pull request #466 from lutzvdb/patch-2
...
Update mid-way-recap.mdx
2024-01-24 10:09:36 +01:00
Thomas Simonini
fdf781ed82
Update train.mdx
2024-01-24 10:03:32 +01:00
Lutz von der Burchard
ca29ddfbf9
Update mid-way-recap.mdx
...
Compare issue 451 (https://github.com/huggingface/deep-rl-class/issues/451 )
2024-01-15 09:45:50 +01:00
Thomas Simonini
32d5564236
Merge pull request #454 from lutzvdb/patch-1
...
Added clarification to the meaning of the rows of the Q-table
2024-01-15 09:42:01 +01:00
Thomas Simonini
6dc5937cbb
Merge pull request #444 from ashwinsnambiar/patch-1
...
Update train.mdx : this commit fixes #443
2024-01-15 09:40:56 +01:00
Eric Dong
bd8a3f87f2
Updated student works
2024-01-03 13:05:13 -06:00
Thomas Simonini
0bfa919876
Merge pull request #445 from varun-sappa/main
...
This PR solves issue #434
2024-01-02 10:12:27 +01:00
Thomas Simonini
cdb5982872
Merge pull request #448 from Ivan-267/patch-1
...
Small typo correction on the Godot-RL section
2024-01-02 10:11:15 +01:00
Thomas Simonini
21a717c70d
Merge pull request #455 from fzyzcjy/patch-2
...
Super tiny fix typo
2024-01-02 10:07:11 +01:00
Adam Molnar
c753f11238
Update setup.mdx
2024-01-01 09:52:06 +01:00
fzyzcjy
5a9389744e
Update visualize.mdx
2023-12-30 14:32:07 +08:00
Lutz von der Burchard
162110aba9
Added clarification to the meaning of the rows of the Q-table
2023-12-28 11:00:33 +01:00
Ivan-267
d3ab17f1aa
Update godotrl.mdx
2023-12-18 16:01:01 +01:00
Varun Sappa
a8dfa2bbd0
Update hands-on.mdx
2023-12-18 12:28:48 +05:30
Varun Sappa
4f61efab80
Update hands-on.mdx
2023-12-18 12:28:03 +05:30
ashwinsnambiar
ed1638df55
Update train.mdx
...
ML Agents parameter --run-id : misspelled with underscore instead of hypen
2023-12-16 22:05:51 +01:00
Thomas Simonini
e7ecdffd41
Merge pull request #437 from fardinafdideh/unit3-deep-q-algorithm
...
unit3 | deep-q-algorithm | catastrophic forgetting
2023-12-12 09:29:00 +01:00
DESKTOP-AENDA0E\Fardin
8692616b7a
unit3 | deep-q-algorithm | catastrophic forgetting
2023-12-11 11:08:54 +01:00
Jose J. Martinez
494751c447
Unit 7 quiz
2023-12-07 16:54:34 +00:00
Jose J. Martinez
abd4a56c32
Unit 5 quiz and rewording of unit 6
2023-12-06 18:30:51 +00:00
Jose J. Martinez
4237fe92c4
Merge branch 'main' into unit-5-quiz
2023-12-06 18:00:37 +00:00
Juan Martinez
f41bf2c5fb
Fixes missing commas
2023-12-06 11:36:29 +00:00
Juan Martinez
f7c510a063
Adds newline after ###
2023-12-06 11:12:38 +00:00