mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-04-09 21:59:22 +08:00
Update environments explanation
This commit is contained in:
@@ -23,7 +23,9 @@ If you want to know more about curiosity, the next section (optional) will expla
|
||||
|
||||
In terms of observation, we **use 148 raycasts that can each detect objects** (switch, bricks, golden brick, and walls.)
|
||||
|
||||
We also use a **boolean variable indicating the switch state** (did we turn on or not the switch to spawn the Pyramid) and a vector that **contains agent’ speed**.
|
||||
We also use a **boolean variable indicating the switch state** (did we turn on or not the switch to spawn the Pyramid) and a vector that **contains the agent’s speed**.
|
||||
|
||||
ADD SCREENSHOT CODE
|
||||
|
||||
## The action space
|
||||
|
||||
|
||||
@@ -19,7 +19,7 @@ TODO ADD IMAGE REWARD
|
||||
|
||||
Regarding observations, we don't use normal vision (frame), but we use raycasts.
|
||||
|
||||
TOOD ADD raycasts that can each detect objects (target, walls)
|
||||
TOOD ADD raycasts that can each detect objects (target, walls) and how much we have
|
||||
|
||||
Think of raycasts as lasers that will detect if it passes through an object.
|
||||
|
||||
|
||||
Reference in New Issue
Block a user