Commit Graph

31 Commits

Author SHA1 Message Date
Balaji Varatharajan
87fcfeb9bb Update variance-problem.mdx
Hi, I've a blog titled [High Variance in Policy gradients](https://balajiai.github.io/high_variance_in_policy_gradients) which also explains about the variance problem in policy gradient and techniques for variance reduction such as baseline and actor-critics method.
I think, it would be valuable to this course readers. So I'm adding it to the reading-list.

Thanks!
2024-02-17 15:16:29 +05:30
Jose J. Martinez
abd4a56c32 Unit 5 quiz and rewording of unit 6 2023-12-06 18:30:51 +00:00
Juan Martinez
f41bf2c5fb Fixes missing commas 2023-12-06 11:36:29 +00:00
Juan Martinez
f7c510a063 Adds newline after ### 2023-12-06 11:12:38 +00:00
Juan Martinez
40cf7684e5 Fixes typo and comma(s) 2023-12-06 11:10:43 +00:00
Juan Martinez
57678da563 Update quiz.mdx 2023-12-03 19:16:18 +00:00
Juan Martinez
a31043822e Create quiz for unit 6 2023-12-03 18:58:37 +00:00
Thomas Simonini
7b4c6d480d Update hands-on.mdx 2023-08-18 08:43:54 +02:00
Thomas Simonini
d430db9ea3 Update hands-on.mdx 2023-08-06 18:23:53 +02:00
Thomas Simonini
32e5a31853 Merge branch 'main' into GymnasiumUpdate/Unit6 2023-08-06 18:14:10 +02:00
Thomas Simonini
e2ab2ee38f Update hands-on.mdx 2023-08-06 18:11:54 +02:00
Thomas Simonini
ca42ab49f8 Update introduction.mdx
* Remove gif
2023-08-06 18:10:42 +02:00
Thomas Simonini
285fc72e2b Update hands-on.mdx 2023-07-19 14:35:26 +02:00
Thomas Simonini
416ec655d0 Update (gymnasium) 2023-05-10 08:41:27 +02:00
Dylan Wilson
4b9cb12bdc Typos Unit6 2023-04-19 10:21:13 -05:00
Andrey Voroshilov
493ddce187 Fixed env name in one of the code blocks 2023-03-13 02:22:36 -07:00
Thomas Simonini
d72a886ef0 Merge pull request #231 from huggingface/ThomasSimonini/SundayUpdate
Sunday Update of the Course
2023-02-25 18:45:56 +01:00
Thomas Simonini
1727d54eeb Update hands-on.mdx 2023-02-25 18:24:21 +01:00
simoninithomas
f744071184 Update Actor Critic 2023-02-25 15:23:02 +01:00
simoninithomas
bd378d0319 Add Leaderboard update 2023-02-25 15:01:22 +01:00
Thomas Simonini
9caf7e2759 Update hands-on.mdx 2023-01-17 14:44:13 +01:00
Thomas Simonini
770adfdd2b Apply suggestions from code review
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
2023-01-17 14:31:28 +01:00
Thomas Simonini
87c33d790b Update advantage-actor-critic.mdx 2023-01-17 14:23:14 +01:00
Thomas Simonini
ae37a884ed Update advantage-actor-critic.mdx 2023-01-17 08:08:58 +01:00
simoninithomas
28ef99046d Finalize A2C 2023-01-17 07:47:05 +01:00
Thomas Simonini
2a35c66ec5 Apply suggestions from code review
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
2023-01-16 18:08:36 +01:00
Thomas Simonini
f937f8c7db Update introduction.mdx 2023-01-02 10:26:55 +01:00
simoninithomas
14bd94d574 Update conclusion 2023-01-01 17:29:07 +01:00
Thomas Simonini
b835b898fc Update conclusion.mdx 2022-12-31 20:36:44 +01:00
simoninithomas
143f169a65 Adding reading resources 2022-12-30 19:05:40 +01:00
simoninithomas
5aeaf3b5c4 Adding updated A2C Unit 2022-12-30 19:01:28 +01:00