Commit Graph

65 Commits

Author SHA1 Message Date
Lutz von der Burchard
ca29ddfbf9 Update mid-way-recap.mdx
Compare issue 451 (https://github.com/huggingface/deep-rl-class/issues/451)
2024-01-15 09:45:50 +01:00
Thomas Simonini
32d5564236 Merge pull request #454 from lutzvdb/patch-1
Added clarification to the meaning of the rows of the Q-table
2024-01-15 09:42:01 +01:00
Lutz von der Burchard
162110aba9 Added clarification to the meaning of the rows of the Q-table 2023-12-28 11:00:33 +01:00
Varun Sappa
a8dfa2bbd0 Update hands-on.mdx 2023-12-18 12:28:48 +05:30
Thomas Simonini
94c0a413dd Merge pull request #395 from PierreCounathe/pierrecounathe/unit-2-frozenlake-observation-space
Unit 2 Correct FrozenLake's observation space
2023-09-13 11:27:16 +02:00
PierreCounathe
a14476a265 Correct FrozenLake's observation space 2023-08-30 10:24:50 +02:00
PierreCounathe
05188727f3 Proposal on small details... 2023-08-27 23:09:52 +02:00
PierreCounathe
eed8941ff5 Change action-value wording in glossary to match previous wording 2023-08-27 23:07:54 +02:00
Thomas Simonini
c8f62d1d46 Update hands-on.mdx 2023-08-17 10:37:31 +02:00
Katz, Ilia (ik216a)
bc9a54adcf Fix typo in glossary.mdx 2023-08-15 14:23:45 +03:00
Katz, Ilia (ik216a)
3760815834 Add MC and TD to Unit2 glossary 2023-08-11 19:24:20 +03:00
Thomas Simonini
352b33315f Update hands-on.mdx 2023-08-08 11:31:41 +02:00
Thomas Simonini
6425c0da80 Merge branch 'main' into 333-Monte-Carlo-Gt 2023-08-05 15:03:06 +02:00
Katz, Ilia (ik216a)
3fec5a9b14 Remove duplication in mc-vs-td.mdx 2023-08-04 21:34:11 +02:00
Wiktor Wójcik
083b4c3856 fix(unit2/hands-on): Fix a small formatting issue. 2023-07-02 19:13:32 +02:00
Andy Walner
255112a2eb random_num 2023-06-26 07:18:21 -07:00
Paul Adamson
e6f921b614 clarify Gt=0 calculation 2023-05-29 09:07:12 -04:00
Thomas Simonini
c8fe275665 Update hands-on.mdx 2023-05-04 07:01:20 +02:00
Thomas Simonini
f92a952349 Update hands-on.mdx 2023-05-04 06:54:07 +02:00
Thomas Simonini
d46598ff1c Update hands-on.mdx
* Gymnasium Update
2023-05-04 06:48:15 +02:00
Dylan Wilson
85b932e409 Typos Unit2 2023-04-18 15:05:47 -05:00
Thomas Simonini
8b74360a47 Merge pull request #263 from arkadyark/policy-algorithms-unit-2
Add on and off policy algorithms to glossary
2023-04-05 09:50:30 +02:00
Arkady Arkhangorodsky
35c3818ed1 Add on and off policy algorithms to glossary 2023-03-28 23:19:45 -04:00
Thomas Simonini
d041fd29ea Update hands-on.mdx 2023-02-25 18:16:01 +01:00
simoninithomas
bd378d0319 Add Leaderboard update 2023-02-25 15:01:22 +01:00
César Rodríguez
3ff8c4e3c2 typo 2023-02-05 19:57:05 +01:00
simoninithomas
e435937214 Some minor updates 2023-01-07 18:13:55 +01:00
Thomas Simonini
bc9bb6c52f Merge pull request #170 from HasarinduPerera/main
Update glossary.mdx [Unit 2]
2022-12-31 21:46:20 +01:00
Thomas Simonini
9b531c3be0 Some small updates 2022-12-31 20:52:40 +01:00
Hasarindu Perera
815ae5ba13 Update glossary.mdx
Add Epsilon-greedy strategy and Greedy strategy.
2022-12-31 13:30:42 +05:30
Thomas Simonini
7b61d9f813 Update bellman-equation.mdx 2022-12-20 14:20:40 +01:00
Thomas Simonini
5f66e67419 Update mc-vs-td.mdx 2022-12-20 14:06:10 +01:00
Thomas Simonini
3bdc44cd35 Update bellman-equation.mdx 2022-12-20 14:05:29 +01:00
Thomas Simonini
beaef9b0a4 Update two-types-value-based-methods.mdx 2022-12-20 14:02:46 +01:00
Thomas Simonini
093bdb1ed8 Merge pull request #137 from ramon-rd/patch-1
Create glossary.mdx
2022-12-20 13:07:07 +01:00
Thomas Simonini
a37804cebf Update glossary.mdx 2022-12-20 13:06:31 +01:00
Artagon
fc66ea7e4a Rephrasing for initial epsilon value 2022-12-17 22:33:02 +01:00
Artagon
96714cdb10 Cases consistency 2022-12-17 22:23:08 +01:00
Artagon
a7d74befb0 Fix midsentence uppercase 'Policy' 2022-12-17 14:47:18 +01:00
Artagon
753ef67eae epsilon-greedy instead of epsilon greedy 2022-12-17 14:45:08 +01:00
Artagon
f913af7300 epsilon smaller or equal to 1.0 2022-12-17 14:39:40 +01:00
Artagon
0a4c6c6f2c fix redundant 'pair' and inconsistent Case. 2022-12-17 14:30:19 +01:00
Artagon
0c3616c03f Replace ** by <b> tags in figcaption 2022-12-16 20:34:24 +01:00
Artagon
0744d542ad Properly display π* 2022-12-16 20:31:49 +01:00
Ramón Rueda
7dd332d713 Create glossary.mdx 2022-12-15 17:44:42 +01:00
Thomas Simonini
55deaa576c Add certification info in hands-on and introduction 2022-12-15 16:03:34 +01:00
ankandrew
f46ce5d5c9 Fix minor bold text issue 2022-12-15 11:01:56 -03:00
Thomas Simonini
95b869b3e6 Update hands-on.mdx 2022-12-13 11:07:58 +01:00
Thomas Simonini
2ca9a92002 Update hands-on.mdx 2022-12-13 10:59:34 +01:00
Thomas Simonini
35088e1f59 Update Bellman Latex equation quiz 2022-12-12 20:53:14 +01:00