Balaji Varatharajan
87fcfeb9bb
Update variance-problem.mdx
...
Hi, I've a blog titled [High Variance in Policy gradients](https://balajiai.github.io/high_variance_in_policy_gradients ) which also explains about the variance problem in policy gradient and techniques for variance reduction such as baseline and actor-critics method.
I think, it would be valuable to this course readers. So I'm adding it to the reading-list.
Thanks!
2024-02-17 15:16:29 +05:30
Jose J. Martinez
abd4a56c32
Unit 5 quiz and rewording of unit 6
2023-12-06 18:30:51 +00:00
Juan Martinez
f41bf2c5fb
Fixes missing commas
2023-12-06 11:36:29 +00:00
Juan Martinez
f7c510a063
Adds newline after ###
2023-12-06 11:12:38 +00:00
Juan Martinez
40cf7684e5
Fixes typo and comma(s)
2023-12-06 11:10:43 +00:00
Juan Martinez
57678da563
Update quiz.mdx
2023-12-03 19:16:18 +00:00
Juan Martinez
a31043822e
Create quiz for unit 6
2023-12-03 18:58:37 +00:00
Thomas Simonini
7b4c6d480d
Update hands-on.mdx
2023-08-18 08:43:54 +02:00
Thomas Simonini
d430db9ea3
Update hands-on.mdx
2023-08-06 18:23:53 +02:00
Thomas Simonini
32e5a31853
Merge branch 'main' into GymnasiumUpdate/Unit6
2023-08-06 18:14:10 +02:00
Thomas Simonini
e2ab2ee38f
Update hands-on.mdx
2023-08-06 18:11:54 +02:00
Thomas Simonini
ca42ab49f8
Update introduction.mdx
...
* Remove gif
2023-08-06 18:10:42 +02:00
Thomas Simonini
285fc72e2b
Update hands-on.mdx
2023-07-19 14:35:26 +02:00
Thomas Simonini
416ec655d0
Update (gymnasium)
2023-05-10 08:41:27 +02:00
Dylan Wilson
4b9cb12bdc
Typos Unit6
2023-04-19 10:21:13 -05:00
Andrey Voroshilov
493ddce187
Fixed env name in one of the code blocks
2023-03-13 02:22:36 -07:00
Thomas Simonini
d72a886ef0
Merge pull request #231 from huggingface/ThomasSimonini/SundayUpdate
...
Sunday Update of the Course
2023-02-25 18:45:56 +01:00
Thomas Simonini
1727d54eeb
Update hands-on.mdx
2023-02-25 18:24:21 +01:00
simoninithomas
f744071184
Update Actor Critic
2023-02-25 15:23:02 +01:00
simoninithomas
bd378d0319
Add Leaderboard update
2023-02-25 15:01:22 +01:00
Thomas Simonini
9caf7e2759
Update hands-on.mdx
2023-01-17 14:44:13 +01:00
Thomas Simonini
770adfdd2b
Apply suggestions from code review
...
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com >
2023-01-17 14:31:28 +01:00
Thomas Simonini
87c33d790b
Update advantage-actor-critic.mdx
2023-01-17 14:23:14 +01:00
Thomas Simonini
ae37a884ed
Update advantage-actor-critic.mdx
2023-01-17 08:08:58 +01:00
simoninithomas
28ef99046d
Finalize A2C
2023-01-17 07:47:05 +01:00
Thomas Simonini
2a35c66ec5
Apply suggestions from code review
...
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com >
2023-01-16 18:08:36 +01:00
Thomas Simonini
f937f8c7db
Update introduction.mdx
2023-01-02 10:26:55 +01:00
simoninithomas
14bd94d574
Update conclusion
2023-01-01 17:29:07 +01:00
Thomas Simonini
b835b898fc
Update conclusion.mdx
2022-12-31 20:36:44 +01:00
simoninithomas
143f169a65
Adding reading resources
2022-12-30 19:05:40 +01:00
simoninithomas
5aeaf3b5c4
Adding updated A2C Unit
2022-12-30 19:01:28 +01:00