Diego Carpintero
|
3c9dac69a3
|
Add glossary to unit4
|
2023-07-05 16:10:57 +02:00 |
|
Kiran Karkera
|
e30be856bc
|
Update advantages-disadvantages.mdx
Change remaining references to rose states
|
2023-05-09 18:47:59 +05:30 |
|
Kiran Karkera
|
adbd2abb38
|
fix for rose/red color tiles naming (#317)
Rename rose to red (coloured) state, since red is the preferred usage (as elsewhere in the document: line 37),
|
2023-05-09 10:49:37 +05:30 |
|
Joe Rowe
|
1105929bf8
|
docs: update PixelCopter action space notes
|
2023-05-05 11:24:28 +01:00 |
|
Thomas Simonini
|
1f0edd7ddd
|
Add colab info
|
2023-05-03 17:38:55 +02:00 |
|
dylwil3
|
afb42f18bd
|
requested change
|
2023-05-02 08:39:07 -05:00 |
|
Dylan Wilson
|
1fc3817d73
|
Typos Unit4
|
2023-04-18 16:18:51 -05:00 |
|
simoninithomas
|
d0967799b4
|
Update Unit 3
|
2023-02-25 15:21:21 +01:00 |
|
simoninithomas
|
bd378d0319
|
Add Leaderboard update
|
2023-02-25 15:01:22 +01:00 |
|
Vinay Kumar
|
704bd156e0
|
Minor typo fix
|
2023-01-12 15:53:05 -05:00 |
|
simoninithomas
|
a0d86e54a5
|
Minor updates
|
2023-01-07 18:22:32 +01:00 |
|
simoninithomas
|
e435937214
|
Some minor updates
|
2023-01-07 18:13:55 +01:00 |
|
Thomas Simonini
|
26e335736e
|
Update hands-on.mdx
|
2023-01-04 14:27:23 +01:00 |
|
Thomas Simonini
|
8a35f1bf67
|
Update hands-on.mdx
|
2023-01-04 14:18:09 +01:00 |
|
Thomas Simonini
|
89e97f0196
|
Update hands-on.mdx
|
2023-01-04 14:10:57 +01:00 |
|
Thomas Simonini
|
49692e07b7
|
Apply suggestions from code review
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
|
2023-01-04 14:02:15 +01:00 |
|
Thomas Simonini
|
5272fb8941
|
Update policy-gradient.mdx
|
2023-01-04 14:00:05 +01:00 |
|
Thomas Simonini
|
fabf98b74f
|
Update what-are-policy-based-methods.mdx
|
2023-01-04 13:58:06 +01:00 |
|
Thomas Simonini
|
2e1e4046a2
|
Update quiz.mdx
|
2023-01-04 11:30:55 +01:00 |
|
Thomas Simonini
|
2e49a1fb6f
|
Update quiz.mdx
|
2023-01-04 11:14:36 +01:00 |
|
simoninithomas
|
c32d96dbc8
|
Add hands on mdx
|
2023-01-04 10:01:54 +01:00 |
|
simoninithomas
|
851b083fcf
|
Add the Quiz
|
2023-01-04 09:07:09 +01:00 |
|
simoninithomas
|
5dbb460d90
|
Modifications based on Omar feedback + cleanup
|
2023-01-04 08:48:30 +01:00 |
|
Thomas Simonini
|
1c93606aec
|
Apply suggestions from code review
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
|
2023-01-04 08:22:31 +01:00 |
|
simoninithomas
|
b94cc104e1
|
Typo
|
2023-01-03 10:07:58 +01:00 |
|
simoninithomas
|
8e0bbdb82e
|
Update maths
|
2023-01-03 09:58:54 +01:00 |
|
simoninithomas
|
53ad3d9a09
|
Add derivation optional
|
2023-01-03 09:44:20 +01:00 |
|
simoninithomas
|
fc00de7e69
|
Add mathematics
|
2023-01-03 09:06:28 +01:00 |
|
simoninithomas
|
c458fb33c7
|
Update PG and add hands-on
|
2023-01-02 22:37:01 +01:00 |
|
simoninithomas
|
e1cf375c36
|
Update advantages-disadvantages and policy gradient
|
2023-01-02 22:23:27 +01:00 |
|
simoninithomas
|
88fded6cf3
|
Update intro and what are policy based mtd
|
2023-01-02 22:05:36 +01:00 |
|
simoninithomas
|
c0c4f9b565
|
Add conclusion
|
2023-01-02 21:52:41 +01:00 |
|
simoninithomas
|
7bb90190c7
|
Update Policy Gradient
|
2023-01-02 21:47:33 +01:00 |
|
simoninithomas
|
bebb6fed17
|
Adding conclusion
|
2023-01-02 21:22:44 +01:00 |
|
simoninithomas
|
2d2dffd4f7
|
Add illustrations PG
|
2023-01-02 20:11:50 +01:00 |
|
simoninithomas
|
1197198d2b
|
Added policy gradient section
|
2023-01-02 14:43:34 +01:00 |
|
simoninithomas
|
c71422e59c
|
First draft unfinished
|
2023-01-01 13:23:38 +01:00 |
|