mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-05-12 02:46:37 +08:00
@@ -6,7 +6,7 @@ A task is an **instance** of a Reinforcement Learning problem. We can have two t
|
|||||||
|
|
||||||
In this case, we have a starting point and an ending point **(a terminal state). This creates an episode**: a list of States, Actions, Rewards, and new States.
|
In this case, we have a starting point and an ending point **(a terminal state). This creates an episode**: a list of States, Actions, Rewards, and new States.
|
||||||
|
|
||||||
For instance, think about Super Mario Bros: an episode begin at the launch of a new Mario Level and ends **when you’re killed or you reached the end of the level.**
|
For instance, think about Super Mario Bros: an episode begins at the launch of a new Mario Level and ends **when you’re killed or you reached the end of the level.**
|
||||||
|
|
||||||
<figure>
|
<figure>
|
||||||
<img src="https://huggingface.co/datasets/huggingface-deep-rl-course/course-images/resolve/main/en/unit1/mario.jpg" alt="Mario">
|
<img src="https://huggingface.co/datasets/huggingface-deep-rl-course/course-images/resolve/main/en/unit1/mario.jpg" alt="Mario">
|
||||||
|
|||||||
Reference in New Issue
Block a user