Commit Graph

37 Commits

Author SHA1 Message Date
Diego Carpintero
3c9dac69a3 Add glossary to unit4 2023-07-05 16:10:57 +02:00
Kiran Karkera
e30be856bc Update advantages-disadvantages.mdx
Change remaining references to rose states
2023-05-09 18:47:59 +05:30
Kiran Karkera
adbd2abb38 fix for rose/red color tiles naming (#317)
Rename rose to red (coloured) state, since red is the preferred usage (as elsewhere in the document: line 37),
2023-05-09 10:49:37 +05:30
Joe Rowe
1105929bf8 docs: update PixelCopter action space notes 2023-05-05 11:24:28 +01:00
Thomas Simonini
1f0edd7ddd Add colab info 2023-05-03 17:38:55 +02:00
dylwil3
afb42f18bd requested change 2023-05-02 08:39:07 -05:00
Dylan Wilson
1fc3817d73 Typos Unit4 2023-04-18 16:18:51 -05:00
simoninithomas
d0967799b4 Update Unit 3 2023-02-25 15:21:21 +01:00
simoninithomas
bd378d0319 Add Leaderboard update 2023-02-25 15:01:22 +01:00
Vinay Kumar
704bd156e0 Minor typo fix 2023-01-12 15:53:05 -05:00
simoninithomas
a0d86e54a5 Minor updates 2023-01-07 18:22:32 +01:00
simoninithomas
e435937214 Some minor updates 2023-01-07 18:13:55 +01:00
Thomas Simonini
26e335736e Update hands-on.mdx 2023-01-04 14:27:23 +01:00
Thomas Simonini
8a35f1bf67 Update hands-on.mdx 2023-01-04 14:18:09 +01:00
Thomas Simonini
89e97f0196 Update hands-on.mdx 2023-01-04 14:10:57 +01:00
Thomas Simonini
49692e07b7 Apply suggestions from code review
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
2023-01-04 14:02:15 +01:00
Thomas Simonini
5272fb8941 Update policy-gradient.mdx 2023-01-04 14:00:05 +01:00
Thomas Simonini
fabf98b74f Update what-are-policy-based-methods.mdx 2023-01-04 13:58:06 +01:00
Thomas Simonini
2e1e4046a2 Update quiz.mdx 2023-01-04 11:30:55 +01:00
Thomas Simonini
2e49a1fb6f Update quiz.mdx 2023-01-04 11:14:36 +01:00
simoninithomas
c32d96dbc8 Add hands on mdx 2023-01-04 10:01:54 +01:00
simoninithomas
851b083fcf Add the Quiz 2023-01-04 09:07:09 +01:00
simoninithomas
5dbb460d90 Modifications based on Omar feedback + cleanup 2023-01-04 08:48:30 +01:00
Thomas Simonini
1c93606aec Apply suggestions from code review
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
2023-01-04 08:22:31 +01:00
simoninithomas
b94cc104e1 Typo 2023-01-03 10:07:58 +01:00
simoninithomas
8e0bbdb82e Update maths 2023-01-03 09:58:54 +01:00
simoninithomas
53ad3d9a09 Add derivation optional 2023-01-03 09:44:20 +01:00
simoninithomas
fc00de7e69 Add mathematics 2023-01-03 09:06:28 +01:00
simoninithomas
c458fb33c7 Update PG and add hands-on 2023-01-02 22:37:01 +01:00
simoninithomas
e1cf375c36 Update advantages-disadvantages and policy gradient 2023-01-02 22:23:27 +01:00
simoninithomas
88fded6cf3 Update intro and what are policy based mtd 2023-01-02 22:05:36 +01:00
simoninithomas
c0c4f9b565 Add conclusion 2023-01-02 21:52:41 +01:00
simoninithomas
7bb90190c7 Update Policy Gradient 2023-01-02 21:47:33 +01:00
simoninithomas
bebb6fed17 Adding conclusion 2023-01-02 21:22:44 +01:00
simoninithomas
2d2dffd4f7 Add illustrations PG 2023-01-02 20:11:50 +01:00
simoninithomas
1197198d2b Added policy gradient section 2023-01-02 14:43:34 +01:00
simoninithomas
c71422e59c First draft unfinished 2023-01-01 13:23:38 +01:00