From cd118ad2cc36303c079856ebf452b6baf9d5d33b Mon Sep 17 00:00:00 2001 From: Thomas Simonini Date: Sat, 7 Jan 2023 17:50:40 +0100 Subject: [PATCH] Update pyramids.mdx --- units/en/unit5/pyramids.mdx | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/units/en/unit5/pyramids.mdx b/units/en/unit5/pyramids.mdx index 4ddf267..8983692 100644 --- a/units/en/unit5/pyramids.mdx +++ b/units/en/unit5/pyramids.mdx @@ -2,14 +2,14 @@ The goal in this environment is to train our agent to **get the gold brick on the top of the Pyramid. In order to do that, it needs to press a button to spawn a pyramid, navigate to the Pyramid, knock it over, and move to the gold brick at the top**. -Pyramids Environment +Pyramids Environment ## The reward function The reward function is: -Pyramids Environment +Pyramids Environment To train this new agent that seeks that button and then the Pyramid to destroy, we’ll use a combination of two types of rewards: