Thomas Simonini
fb3cb0cc72
Merge pull request #522 from AbhishekRana21/main
...
a small grammatical error fix
2024-05-02 09:01:07 +02:00
Abhishek Rana
763c7c0c97
Update hands-on.mdx
2024-05-01 16:59:33 +05:30
Abhishek Rana
bc3493e3ac
Update unit1.ipynb
2024-05-01 16:55:25 +05:30
Thomas Simonini
ebfd6d5470
Merge pull request #488 from PierreCounathe/pierrecounathe/unit-4-propositions
...
Unit 4 Proposal Updates
2024-04-19 08:23:04 +02:00
Thomas Simonini
732d543677
Merge branch 'main' into pierrecounathe/unit-4-propositions
2024-04-19 08:14:34 +02:00
Thomas Simonini
e9f1aff33f
Merge pull request #513 from S-N-O-R-L-A-X/main
...
Fix gap between text and math signs
2024-04-18 08:46:20 +02:00
S-N-O-R-L-A-X
ddcdc8cd3a
fix: fix gap between math signs and text
2024-04-07 16:10:57 +08:00
S-N-O-R-L-A-X
f123308a28
fix: fix doc equations
2024-04-04 19:33:25 +08:00
Thomas Simonini
c929ba2e6e
Merge pull request #503 from huggingface/ThomasSimonini/UpdateUnit4
...
Update pg-theorem.mdx
2024-03-05 10:49:34 +01:00
Thomas Simonini
72473f08a8
Update pg-theorem.mdx
2024-03-05 10:45:12 +01:00
Thomas Simonini
9d777a01b0
Update pg-theorem.mdx
2024-03-05 10:40:03 +01:00
Thomas Simonini
0e55db0106
Merge pull request #493 from S-N-O-R-L-A-X/patch-1
...
fix error in quiz2.mdx
2024-03-05 10:33:06 +01:00
Thomas Simonini
df5ffa3917
Merge pull request #499 from alexpalms/main
...
Add DIAMBRA Arena RL Environment
2024-03-04 17:44:12 +01:00
Thomas Simonini
3dfa91c052
Merge pull request #500 from Croolch/unit1ipynb-fix
...
Update unit1.ipynb
2024-03-04 17:04:01 +01:00
Thomas Simonini
bf5a72ad6c
Merge pull request #420 from fzyzcjy/patch-1
...
Super tiny fix format
2024-03-04 16:59:57 +01:00
Alessandro Palmas
382c69caa4
Update units/en/unitbonus3/envs-to-try.mdx
...
Co-authored-by: Thomas Simonini <simonini.thomas.pro@gmail.com >
2024-03-02 14:59:06 -05:00
Alessandro Palmas
e8b6db8a32
Update units/en/unitbonus3/envs-to-try.mdx
...
Co-authored-by: Thomas Simonini <simonini.thomas.pro@gmail.com >
2024-03-02 14:58:10 -05:00
Ivan
311e125d06
Update unit1.ipynb
2024-03-02 14:13:20 +08:00
Alessandro Palmas
2db3b14f4a
Update diambra arena image
2024-03-01 23:32:13 -05:00
Alessandro Palmas
cd30c90961
Updated page
2024-03-01 23:28:19 -05:00
Thomas Simonini
262cc0c608
Merge pull request #498 from huggingface/simoninithomas-patch-1
...
Delete .github/workflows/delete_doc_comment_trigger.yml
2024-03-01 16:22:49 +01:00
Thomas Simonini
c43bad97b7
Delete .github/workflows/delete_doc_comment_trigger.yml
2024-03-01 16:22:39 +01:00
Thomas Simonini
dbc300003c
Merge pull request #497 from huggingface/DeleteActions
...
Delete actions
2024-03-01 16:20:03 +01:00
Thomas Simonini
74ce0e458f
Delete .github/workflows/delete_doc_comment.yml
2024-03-01 16:19:10 +01:00
Thomas Simonini
4b416005e7
Merge pull request #487 from PierreCounathe/pierrecounathe/unit-5-propositions
...
Unit 5 Proposal Updates
2024-03-01 15:57:44 +01:00
Thomas Simonini
1da2ef65ee
Merge pull request #489 from MrPuppeteer/patch-1
...
Fix typo in discord101.mdx
2024-03-01 15:55:40 +01:00
Thomas Simonini
1b09e7cbac
Merge pull request #490 from BalajiAI/patch-1
...
Update variance-problem.mdx
2024-03-01 15:53:29 +01:00
Thomas Simonini
c8e9422622
Merge pull request #494 from huggingface/ThomasSimonini/Unit5Update
...
Unit 5: Update Hands on
2024-02-26 10:11:09 +01:00
Thomas Simonini
00c6120fe6
Update hands-on.mdx
2024-02-26 10:10:56 +01:00
Thomas Simonini
8cde698f60
Update unit5.ipynb
2024-02-26 10:02:14 +01:00
Thomas Simonini
0535a45230
Update hands-on.mdx
...
* Change pyramids environment
2024-02-26 09:59:54 +01:00
Thomas Simonini
89f68ae039
Update Pyramids download
2024-02-26 09:57:41 +01:00
SNORLAX
5e5ea78e63
fix error in quiz2.mdx
2024-02-24 20:37:58 +08:00
Alessandro Palmas
f4e21ebc8d
Add some links
2024-02-23 00:10:43 -05:00
Alessandro Palmas
7bf227dea2
Add DIAMBRA to envs to try
2024-02-22 23:53:50 -05:00
Balaji Varatharajan
87fcfeb9bb
Update variance-problem.mdx
...
Hi, I've a blog titled [High Variance in Policy gradients](https://balajiai.github.io/high_variance_in_policy_gradients ) which also explains about the variance problem in policy gradient and techniques for variance reduction such as baseline and actor-critics method.
I think, it would be valuable to this course readers. So I'm adding it to the reading-list.
Thanks!
2024-02-17 15:16:29 +05:30
Thomas Simonini
6ab84a4e8e
Using wget instead
2024-02-16 17:35:15 +01:00
Bagas N
62183fd456
Fix typo in discord101.mdx
2024-02-12 12:52:59 +07:00
Pierre Counathe
5d6a406589
nits
2024-02-09 19:32:40 -08:00
Pierre Counathe
33b97e99ec
proposal
2024-02-09 19:21:04 -08:00
Thomas Simonini
d7ec1b4ae3
Merge pull request #477 from RichardKhanhWin/unit1-typo-fix
...
Unit 1 typo fixes
2024-01-29 10:50:24 +01:00
Richard Khanh Manh Nguyen
108870e2c9
Update unit1.ipynb
...
Fix typo.
2024-01-28 01:03:44 -06:00
Richard Khanh Manh Nguyen
da63862afe
Update hands-on.mdx
...
Fix typo
2024-01-28 00:57:15 -06:00
Thomas Simonini
04f2277489
Merge pull request #474 from huggingface/ThomasSimonini/Unit5Update
...
Update Unit 5
2024-01-24 10:21:46 +01:00
Thomas Simonini
193f63bd9c
Update hands-on.mdx
2024-01-24 10:19:28 +01:00
Thomas Simonini
07ff43dd8d
Update wget with GitHub link instead
2024-01-24 10:18:16 +01:00
Thomas Simonini
605ce608cf
Merge pull request #466 from lutzvdb/patch-2
...
Update mid-way-recap.mdx
2024-01-24 10:09:36 +01:00
Thomas Simonini
4966807d7a
Merge pull request #473 from huggingface/ThomasSimonini/HuggyUpdate
...
Update Huggy
2024-01-24 10:07:16 +01:00
Thomas Simonini
fdf781ed82
Update train.mdx
2024-01-24 10:03:32 +01:00
Thomas Simonini
dab558b310
Update wget with a Github Repository
2024-01-24 10:01:41 +01:00