Lutz von der Burchard
|
ca29ddfbf9
|
Update mid-way-recap.mdx
Compare issue 451 (https://github.com/huggingface/deep-rl-class/issues/451)
|
2024-01-15 09:45:50 +01:00 |
|
Thomas Simonini
|
32d5564236
|
Merge pull request #454 from lutzvdb/patch-1
Added clarification to the meaning of the rows of the Q-table
|
2024-01-15 09:42:01 +01:00 |
|
Lutz von der Burchard
|
162110aba9
|
Added clarification to the meaning of the rows of the Q-table
|
2023-12-28 11:00:33 +01:00 |
|
Varun Sappa
|
a8dfa2bbd0
|
Update hands-on.mdx
|
2023-12-18 12:28:48 +05:30 |
|
Thomas Simonini
|
94c0a413dd
|
Merge pull request #395 from PierreCounathe/pierrecounathe/unit-2-frozenlake-observation-space
Unit 2 Correct FrozenLake's observation space
|
2023-09-13 11:27:16 +02:00 |
|
PierreCounathe
|
a14476a265
|
Correct FrozenLake's observation space
|
2023-08-30 10:24:50 +02:00 |
|
PierreCounathe
|
05188727f3
|
Proposal on small details...
|
2023-08-27 23:09:52 +02:00 |
|
PierreCounathe
|
eed8941ff5
|
Change action-value wording in glossary to match previous wording
|
2023-08-27 23:07:54 +02:00 |
|
Thomas Simonini
|
c8f62d1d46
|
Update hands-on.mdx
|
2023-08-17 10:37:31 +02:00 |
|
Katz, Ilia (ik216a)
|
bc9a54adcf
|
Fix typo in glossary.mdx
|
2023-08-15 14:23:45 +03:00 |
|
Katz, Ilia (ik216a)
|
3760815834
|
Add MC and TD to Unit2 glossary
|
2023-08-11 19:24:20 +03:00 |
|
Thomas Simonini
|
352b33315f
|
Update hands-on.mdx
|
2023-08-08 11:31:41 +02:00 |
|
Thomas Simonini
|
6425c0da80
|
Merge branch 'main' into 333-Monte-Carlo-Gt
|
2023-08-05 15:03:06 +02:00 |
|
Katz, Ilia (ik216a)
|
3fec5a9b14
|
Remove duplication in mc-vs-td.mdx
|
2023-08-04 21:34:11 +02:00 |
|
Wiktor Wójcik
|
083b4c3856
|
fix(unit2/hands-on): Fix a small formatting issue.
|
2023-07-02 19:13:32 +02:00 |
|
Andy Walner
|
255112a2eb
|
random_num
|
2023-06-26 07:18:21 -07:00 |
|
Paul Adamson
|
e6f921b614
|
clarify Gt=0 calculation
|
2023-05-29 09:07:12 -04:00 |
|
Thomas Simonini
|
c8fe275665
|
Update hands-on.mdx
|
2023-05-04 07:01:20 +02:00 |
|
Thomas Simonini
|
f92a952349
|
Update hands-on.mdx
|
2023-05-04 06:54:07 +02:00 |
|
Thomas Simonini
|
d46598ff1c
|
Update hands-on.mdx
* Gymnasium Update
|
2023-05-04 06:48:15 +02:00 |
|
Dylan Wilson
|
85b932e409
|
Typos Unit2
|
2023-04-18 15:05:47 -05:00 |
|
Thomas Simonini
|
8b74360a47
|
Merge pull request #263 from arkadyark/policy-algorithms-unit-2
Add on and off policy algorithms to glossary
|
2023-04-05 09:50:30 +02:00 |
|
Arkady Arkhangorodsky
|
35c3818ed1
|
Add on and off policy algorithms to glossary
|
2023-03-28 23:19:45 -04:00 |
|
Thomas Simonini
|
d041fd29ea
|
Update hands-on.mdx
|
2023-02-25 18:16:01 +01:00 |
|
simoninithomas
|
bd378d0319
|
Add Leaderboard update
|
2023-02-25 15:01:22 +01:00 |
|
César Rodríguez
|
3ff8c4e3c2
|
typo
|
2023-02-05 19:57:05 +01:00 |
|
simoninithomas
|
e435937214
|
Some minor updates
|
2023-01-07 18:13:55 +01:00 |
|
Thomas Simonini
|
bc9bb6c52f
|
Merge pull request #170 from HasarinduPerera/main
Update glossary.mdx [Unit 2]
|
2022-12-31 21:46:20 +01:00 |
|
Thomas Simonini
|
9b531c3be0
|
Some small updates
|
2022-12-31 20:52:40 +01:00 |
|
Hasarindu Perera
|
815ae5ba13
|
Update glossary.mdx
Add Epsilon-greedy strategy and Greedy strategy.
|
2022-12-31 13:30:42 +05:30 |
|
Thomas Simonini
|
7b61d9f813
|
Update bellman-equation.mdx
|
2022-12-20 14:20:40 +01:00 |
|
Thomas Simonini
|
5f66e67419
|
Update mc-vs-td.mdx
|
2022-12-20 14:06:10 +01:00 |
|
Thomas Simonini
|
3bdc44cd35
|
Update bellman-equation.mdx
|
2022-12-20 14:05:29 +01:00 |
|
Thomas Simonini
|
beaef9b0a4
|
Update two-types-value-based-methods.mdx
|
2022-12-20 14:02:46 +01:00 |
|
Thomas Simonini
|
093bdb1ed8
|
Merge pull request #137 from ramon-rd/patch-1
Create glossary.mdx
|
2022-12-20 13:07:07 +01:00 |
|
Thomas Simonini
|
a37804cebf
|
Update glossary.mdx
|
2022-12-20 13:06:31 +01:00 |
|
Artagon
|
fc66ea7e4a
|
Rephrasing for initial epsilon value
|
2022-12-17 22:33:02 +01:00 |
|
Artagon
|
96714cdb10
|
Cases consistency
|
2022-12-17 22:23:08 +01:00 |
|
Artagon
|
a7d74befb0
|
Fix midsentence uppercase 'Policy'
|
2022-12-17 14:47:18 +01:00 |
|
Artagon
|
753ef67eae
|
epsilon-greedy instead of epsilon greedy
|
2022-12-17 14:45:08 +01:00 |
|
Artagon
|
f913af7300
|
epsilon smaller or equal to 1.0
|
2022-12-17 14:39:40 +01:00 |
|
Artagon
|
0a4c6c6f2c
|
fix redundant 'pair' and inconsistent Case.
|
2022-12-17 14:30:19 +01:00 |
|
Artagon
|
0c3616c03f
|
Replace ** by <b> tags in figcaption
|
2022-12-16 20:34:24 +01:00 |
|
Artagon
|
0744d542ad
|
Properly display π*
|
2022-12-16 20:31:49 +01:00 |
|
Ramón Rueda
|
7dd332d713
|
Create glossary.mdx
|
2022-12-15 17:44:42 +01:00 |
|
Thomas Simonini
|
55deaa576c
|
Add certification info in hands-on and introduction
|
2022-12-15 16:03:34 +01:00 |
|
ankandrew
|
f46ce5d5c9
|
Fix minor bold text issue
|
2022-12-15 11:01:56 -03:00 |
|
Thomas Simonini
|
95b869b3e6
|
Update hands-on.mdx
|
2022-12-13 11:07:58 +01:00 |
|
Thomas Simonini
|
2ca9a92002
|
Update hands-on.mdx
|
2022-12-13 10:59:34 +01:00 |
|
Thomas Simonini
|
35088e1f59
|
Update Bellman Latex equation quiz
|
2022-12-12 20:53:14 +01:00 |
|