mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-04-01 17:51:01 +08:00
Update rl-framework.mdx
This commit is contained in:
@@ -14,7 +14,7 @@ To understand the RL process, let’s imagine an agent learning to play a platfo
|
||||
|
||||
\$\sqrt{2}\$
|
||||
|
||||
- Our Agent receives **state $S_0$** from the **Environment** — we receive the first frame of our game (Environment).
|
||||
- Our Agent receives **state \\(S_0\\)** from the **Environment** — we receive the first frame of our game (Environment).
|
||||
- Based on that **state \\(S_0\\),** the Agent takes **action \\(A_0\\)** — our Agent will move to the right.
|
||||
- Environment goes to a **new** **state \\(S_1\\)** — new frame.
|
||||
- The environment gives some **reward \\(R_1\\)** to the Agent — we’re not dead *(Positive Reward +1)*.
|
||||
|
||||
Reference in New Issue
Block a user