mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-04-07 20:49:45 +08:00
Update hands-on.mdx
This commit is contained in:
@@ -353,7 +353,7 @@ def greedy_policy(Qtable, state):
|
||||
return action
|
||||
```
|
||||
|
||||
##Define the epsilon-greedy policy 🤖
|
||||
## Define the epsilon-greedy policy 🤖
|
||||
|
||||
Epsilon-greedy is the training policy that handles the exploration/exploitation trade-off.
|
||||
|
||||
|
||||
Reference in New Issue
Block a user