Commit Graph

227 Commits

Author SHA1 Message Date
Thomas Simonini
f17adcdf55 Merge pull request #173 from huggingface/ThomasSimonini/MLAgents
Add Unit Introduction to ML-Agents
2023-01-10 10:23:53 +01:00
simoninithomas
368a1823b6 Update hands on 2023-01-10 10:19:05 +01:00
simoninithomas
08a92b2e09 Add notebook 2023-01-10 10:11:49 +01:00
simoninithomas
e909adee3f Cleanups 2023-01-08 15:49:22 +01:00
Thomas Simonini
f3b2cb11bc Update introduction.mdx 2023-01-08 12:10:30 +01:00
Thomas Simonini
471cb9ac6a Update snowball-target.mdx 2023-01-08 09:37:31 +01:00
Thomas Simonini
980f620191 Add explanation SnowballTarget 2023-01-08 09:29:49 +01:00
simoninithomas
a0d86e54a5 Minor updates 2023-01-07 18:22:32 +01:00
simoninithomas
e435937214 Some minor updates 2023-01-07 18:13:55 +01:00
Thomas Simonini
888bedbff4 Update hands-on.mdx 2023-01-07 17:55:10 +01:00
Thomas Simonini
19c3876657 Update pyramids.mdx 2023-01-07 17:54:05 +01:00
Thomas Simonini
cd118ad2cc Update pyramids.mdx 2023-01-07 17:50:40 +01:00
Thomas Simonini
b31054486c Update snowball-target.mdx 2023-01-07 17:40:01 +01:00
Thomas Simonini
ad006f116c Update snowball-target.mdx 2023-01-07 17:39:17 +01:00
Thomas Simonini
1e28b345a3 Update snowball-target.mdx 2023-01-07 17:36:56 +01:00
Thomas Simonini
5d88a5b9e8 Update snowball-target.mdx 2023-01-07 17:36:28 +01:00
simoninithomas
bce8ba85ed Update MLAgents 2023-01-07 17:27:14 +01:00
Thomas Simonini
45e3247ea8 Update introduction.mdx 2023-01-07 15:37:12 +01:00
Thomas Simonini
e45058ea10 Update play.mdx 2023-01-07 15:36:27 +01:00
Thomas Simonini
217840b33a Update Huggy Link 2023-01-07 15:34:53 +01:00
Thomas Simonini
3af07bbeac Change Huggy Link 2023-01-07 15:32:59 +01:00
simoninithomas
98f4c85709 Add illustrations 2023-01-07 10:48:28 +01:00
simoninithomas
92dc5ce8eb Add illustrations 2023-01-07 10:46:46 +01:00
simoninithomas
759bf0d113 Updates MLAgents Unit 2023-01-07 10:12:52 +01:00
Thomas Simonini
fb12b509ef Update snowball-target.mdx 2023-01-06 18:01:33 +01:00
Thomas Simonini
583462ff23 Update introduction.mdx 2023-01-06 17:58:22 +01:00
simoninithomas
0d352e4f7a Update environments explanation 2023-01-06 14:27:57 +01:00
simoninithomas
a86695b50e Update snowball target explanation 2023-01-06 14:19:02 +01:00
simoninithomas
8baa4e45b6 Update MLAgents introduction 2023-01-06 14:06:34 +01:00
Vinay Kumar
056f6b054f Minor typo fix 2023-01-04 22:42:43 -05:00
Thomas Simonini
017465ef4c Update hands-on.mdx 2023-01-04 21:25:31 +01:00
Thomas Simonini
816901d50d Merge branch 'main' into ThomasSimonini/MLAgents 2023-01-04 16:21:08 +01:00
Thomas Simonini
26e335736e Update hands-on.mdx 2023-01-04 14:27:23 +01:00
Thomas Simonini
8a35f1bf67 Update hands-on.mdx 2023-01-04 14:18:09 +01:00
Thomas Simonini
89e97f0196 Update hands-on.mdx 2023-01-04 14:10:57 +01:00
Thomas Simonini
49692e07b7 Apply suggestions from code review
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
2023-01-04 14:02:15 +01:00
Thomas Simonini
5272fb8941 Update policy-gradient.mdx 2023-01-04 14:00:05 +01:00
Thomas Simonini
fabf98b74f Update what-are-policy-based-methods.mdx 2023-01-04 13:58:06 +01:00
Thomas Simonini
2e1e4046a2 Update quiz.mdx 2023-01-04 11:30:55 +01:00
Thomas Simonini
2e49a1fb6f Update quiz.mdx 2023-01-04 11:14:36 +01:00
simoninithomas
c32d96dbc8 Add hands on mdx 2023-01-04 10:01:54 +01:00
simoninithomas
851b083fcf Add the Quiz 2023-01-04 09:07:09 +01:00
simoninithomas
5dbb460d90 Modifications based on Omar feedback + cleanup 2023-01-04 08:48:30 +01:00
Thomas Simonini
1c93606aec Apply suggestions from code review
Co-authored-by: Omar Sanseviero <osanseviero@gmail.com>
2023-01-04 08:22:31 +01:00
simoninithomas
b94cc104e1 Typo 2023-01-03 10:07:58 +01:00
simoninithomas
8e0bbdb82e Update maths 2023-01-03 09:58:54 +01:00
simoninithomas
53ad3d9a09 Add derivation optional 2023-01-03 09:44:20 +01:00
simoninithomas
fc00de7e69 Add mathematics 2023-01-03 09:06:28 +01:00
simoninithomas
c458fb33c7 Update PG and add hands-on 2023-01-02 22:37:01 +01:00
simoninithomas
e1cf375c36 Update advantages-disadvantages and policy gradient 2023-01-02 22:23:27 +01:00