Change action-value wording in glossary to match previous wording

This commit is contained in:
PierreCounathe
2023-08-27 23:07:54 +02:00
parent 207ca5b9fd
commit eed8941ff5

View File

@@ -11,7 +11,7 @@ This is a community-created glossary. Contributions are welcomed!
### Among the value-based methods, we can find two main strategies
- **The state-value function.** For each state, the state-value function is the expected return if the agent starts in that state and follows the policy until the end.
- **The action-value function.** In contrast to the state-value function, the action-value calculates for each state and action pair the expected return if the agent starts in that state and takes an action. Then it follows the policy forever after.
- **The action-value function.** In contrast to the state-value function, the action-value calculates for each state and action pair the expected return if the agent starts in that state, takes that action, and then follows the policy forever after.
### Epsilon-greedy strategy: