mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-04-09 13:50:23 +08:00
Remove duplication in mc-vs-td.mdx
This commit is contained in:
@@ -58,7 +58,6 @@ For instance, if we train a state-value function using Monte Carlo:
|
||||
|
||||
|
||||
- We have a list of state, action, rewards, next_state, **we need to calculate the return \\(G{t}\\)**
|
||||
- \\(G_t = R_{t+1} + R_{t+2} + R_{t+3} ...\\)
|
||||
- \\(G_t = R_{t+1} + R_{t+2} + R_{t+3}…\\) (for simplicity we don’t discount the rewards).
|
||||
- \\(G_t = 1 + 0 + 0 + 0+ 0 + 0 + 1 + 1 + 0 + 0\\)
|
||||
- \\(G_t= 3\\)
|
||||
|
||||
Reference in New Issue
Block a user