mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-02-08 12:54:32 +08:00
Fix midsentence uppercase 'Policy'
This commit is contained in:
@@ -144,7 +144,7 @@ Is different from the policy we use during the training part:
|
||||
|
||||
- *On-policy:* using the **same policy for acting and updating.**
|
||||
|
||||
For instance, with Sarsa, another value-based algorithm, **the epsilon-greedy Policy selects the next state-action pair, not a greedy policy.**
|
||||
For instance, with Sarsa, another value-based algorithm, **the epsilon-greedy policy selects the next state-action pair, not a greedy policy.**
|
||||
|
||||
|
||||
<figure>
|
||||
|
||||
Reference in New Issue
Block a user