mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-06-15 06:27:24 +08:00
docs: update PixelCopter action space notes
This commit is contained in:
@@ -888,8 +888,8 @@ The observation space (7) 👀:
|
||||
- next blocks bottom y location
|
||||
|
||||
The action space(2) 🎮:
|
||||
- Up
|
||||
- Down
|
||||
- Up (press accelerator)
|
||||
- Do nothing (don't press accelerator)
|
||||
|
||||
The reward function 💰:
|
||||
- For each vertical block it passes, it gains a positive reward of +1. Each time a terminal state is reached it receives a negative reward of -1.
|
||||
|
||||
Reference in New Issue
Block a user