diff --git a/units/en/unit1/tasks.mdx b/units/en/unit1/tasks.mdx index a5e7e05..cfb4d86 100644 --- a/units/en/unit1/tasks.mdx +++ b/units/en/unit1/tasks.mdx @@ -6,7 +6,7 @@ A task is an **instance** of a Reinforcement Learning problem. We can have two t In this case, we have a starting point and an ending point **(a terminal state). This creates an episode**: a list of States, Actions, Rewards, and new States. -For instance, think about Super Mario Bros: an episode begin at the launch of a new Mario Level and ends **when you’re killed or you reached the end of the level.** +For instance, think about Super Mario Bros: an episode begins at the launch of a new Mario Level and ends **when you’re killed or you reached the end of the level.**
Mario