Commit Graph

1083 Commits

Author SHA1 Message Date
Thomas Simonini
fb3cb0cc72 Merge pull request #522 from AbhishekRana21/main
a small grammatical error fix
2024-05-02 09:01:07 +02:00
Abhishek Rana
763c7c0c97 Update hands-on.mdx 2024-05-01 16:59:33 +05:30
Abhishek Rana
bc3493e3ac Update unit1.ipynb 2024-05-01 16:55:25 +05:30
Thomas Simonini
ebfd6d5470 Merge pull request #488 from PierreCounathe/pierrecounathe/unit-4-propositions
Unit 4 Proposal Updates
2024-04-19 08:23:04 +02:00
Thomas Simonini
732d543677 Merge branch 'main' into pierrecounathe/unit-4-propositions 2024-04-19 08:14:34 +02:00
Thomas Simonini
e9f1aff33f Merge pull request #513 from S-N-O-R-L-A-X/main
Fix gap between text and math signs
2024-04-18 08:46:20 +02:00
S-N-O-R-L-A-X
ddcdc8cd3a fix: fix gap between math signs and text 2024-04-07 16:10:57 +08:00
S-N-O-R-L-A-X
f123308a28 fix: fix doc equations 2024-04-04 19:33:25 +08:00
Thomas Simonini
c929ba2e6e Merge pull request #503 from huggingface/ThomasSimonini/UpdateUnit4
Update pg-theorem.mdx
2024-03-05 10:49:34 +01:00
Thomas Simonini
72473f08a8 Update pg-theorem.mdx 2024-03-05 10:45:12 +01:00
Thomas Simonini
9d777a01b0 Update pg-theorem.mdx 2024-03-05 10:40:03 +01:00
Thomas Simonini
0e55db0106 Merge pull request #493 from S-N-O-R-L-A-X/patch-1
fix error in quiz2.mdx
2024-03-05 10:33:06 +01:00
Thomas Simonini
df5ffa3917 Merge pull request #499 from alexpalms/main
Add DIAMBRA Arena RL Environment
2024-03-04 17:44:12 +01:00
Thomas Simonini
3dfa91c052 Merge pull request #500 from Croolch/unit1ipynb-fix
Update unit1.ipynb
2024-03-04 17:04:01 +01:00
Thomas Simonini
bf5a72ad6c Merge pull request #420 from fzyzcjy/patch-1
Super tiny fix format
2024-03-04 16:59:57 +01:00
Alessandro Palmas
382c69caa4 Update units/en/unitbonus3/envs-to-try.mdx
Co-authored-by: Thomas Simonini <simonini.thomas.pro@gmail.com>
2024-03-02 14:59:06 -05:00
Alessandro Palmas
e8b6db8a32 Update units/en/unitbonus3/envs-to-try.mdx
Co-authored-by: Thomas Simonini <simonini.thomas.pro@gmail.com>
2024-03-02 14:58:10 -05:00
Ivan
311e125d06 Update unit1.ipynb 2024-03-02 14:13:20 +08:00
Alessandro Palmas
2db3b14f4a Update diambra arena image 2024-03-01 23:32:13 -05:00
Alessandro Palmas
cd30c90961 Updated page 2024-03-01 23:28:19 -05:00
Thomas Simonini
262cc0c608 Merge pull request #498 from huggingface/simoninithomas-patch-1
Delete .github/workflows/delete_doc_comment_trigger.yml
2024-03-01 16:22:49 +01:00
Thomas Simonini
c43bad97b7 Delete .github/workflows/delete_doc_comment_trigger.yml 2024-03-01 16:22:39 +01:00
Thomas Simonini
dbc300003c Merge pull request #497 from huggingface/DeleteActions
Delete actions
2024-03-01 16:20:03 +01:00
Thomas Simonini
74ce0e458f Delete .github/workflows/delete_doc_comment.yml 2024-03-01 16:19:10 +01:00
Thomas Simonini
4b416005e7 Merge pull request #487 from PierreCounathe/pierrecounathe/unit-5-propositions
Unit 5 Proposal Updates
2024-03-01 15:57:44 +01:00
Thomas Simonini
1da2ef65ee Merge pull request #489 from MrPuppeteer/patch-1
Fix typo in discord101.mdx
2024-03-01 15:55:40 +01:00
Thomas Simonini
1b09e7cbac Merge pull request #490 from BalajiAI/patch-1
Update variance-problem.mdx
2024-03-01 15:53:29 +01:00
Thomas Simonini
c8e9422622 Merge pull request #494 from huggingface/ThomasSimonini/Unit5Update
Unit 5: Update Hands on
2024-02-26 10:11:09 +01:00
Thomas Simonini
00c6120fe6 Update hands-on.mdx 2024-02-26 10:10:56 +01:00
Thomas Simonini
8cde698f60 Update unit5.ipynb 2024-02-26 10:02:14 +01:00
Thomas Simonini
0535a45230 Update hands-on.mdx
* Change pyramids environment
2024-02-26 09:59:54 +01:00
Thomas Simonini
89f68ae039 Update Pyramids download 2024-02-26 09:57:41 +01:00
SNORLAX
5e5ea78e63 fix error in quiz2.mdx 2024-02-24 20:37:58 +08:00
Alessandro Palmas
f4e21ebc8d Add some links 2024-02-23 00:10:43 -05:00
Alessandro Palmas
7bf227dea2 Add DIAMBRA to envs to try 2024-02-22 23:53:50 -05:00
Balaji Varatharajan
87fcfeb9bb Update variance-problem.mdx
Hi, I've a blog titled [High Variance in Policy gradients](https://balajiai.github.io/high_variance_in_policy_gradients) which also explains about the variance problem in policy gradient and techniques for variance reduction such as baseline and actor-critics method.
I think, it would be valuable to this course readers. So I'm adding it to the reading-list.

Thanks!
2024-02-17 15:16:29 +05:30
Thomas Simonini
6ab84a4e8e Using wget instead 2024-02-16 17:35:15 +01:00
Bagas N
62183fd456 Fix typo in discord101.mdx 2024-02-12 12:52:59 +07:00
Pierre Counathe
5d6a406589 nits 2024-02-09 19:32:40 -08:00
Pierre Counathe
33b97e99ec proposal 2024-02-09 19:21:04 -08:00
Thomas Simonini
d7ec1b4ae3 Merge pull request #477 from RichardKhanhWin/unit1-typo-fix
Unit 1 typo fixes
2024-01-29 10:50:24 +01:00
Richard Khanh Manh Nguyen
108870e2c9 Update unit1.ipynb
Fix typo.
2024-01-28 01:03:44 -06:00
Richard Khanh Manh Nguyen
da63862afe Update hands-on.mdx
Fix typo
2024-01-28 00:57:15 -06:00
Thomas Simonini
04f2277489 Merge pull request #474 from huggingface/ThomasSimonini/Unit5Update
Update Unit 5
2024-01-24 10:21:46 +01:00
Thomas Simonini
193f63bd9c Update hands-on.mdx 2024-01-24 10:19:28 +01:00
Thomas Simonini
07ff43dd8d Update wget with GitHub link instead 2024-01-24 10:18:16 +01:00
Thomas Simonini
605ce608cf Merge pull request #466 from lutzvdb/patch-2
Update mid-way-recap.mdx
2024-01-24 10:09:36 +01:00
Thomas Simonini
4966807d7a Merge pull request #473 from huggingface/ThomasSimonini/HuggyUpdate
Update Huggy
2024-01-24 10:07:16 +01:00
Thomas Simonini
fdf781ed82 Update train.mdx 2024-01-24 10:03:32 +01:00
Thomas Simonini
dab558b310 Update wget with a Github Repository 2024-01-24 10:01:41 +01:00