mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-04-14 18:31:36 +08:00
Fix typo in glossary.mdx
This commit is contained in:
@@ -34,7 +34,7 @@ This is a community-created glossary. Contributions are welcomed!
|
||||
|
||||
### Monte Carlo and Temporal Difference learning strategies
|
||||
|
||||
- **Monte Carlo (MC):** Learning at the end of the episode. With Monte Carlo, we wait until the episode ends and then we update the value functin (or policy function) from a complete episode.
|
||||
- **Monte Carlo (MC):** Learning at the end of the episode. With Monte Carlo, we wait until the episode ends and then we update the value function (or policy function) from a complete episode.
|
||||
|
||||
- **Temporal Difference (TD):** Learning at each step. With Temporal Difference Learning, we update the value function (or policy function) at each step without requiring a complete episode.
|
||||
|
||||
|
||||
Reference in New Issue
Block a user