diff --git a/units/en/unit5/pyramids.mdx b/units/en/unit5/pyramids.mdx index 4ddf267..8983692 100644 --- a/units/en/unit5/pyramids.mdx +++ b/units/en/unit5/pyramids.mdx @@ -2,14 +2,14 @@ The goal in this environment is to train our agent to **get the gold brick on the top of the Pyramid. In order to do that, it needs to press a button to spawn a pyramid, navigate to the Pyramid, knock it over, and move to the gold brick at the top**. -Pyramids Environment +Pyramids Environment ## The reward function The reward function is: -Pyramids Environment +Pyramids Environment To train this new agent that seeks that button and then the Pyramid to destroy, we’ll use a combination of two types of rewards: