mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-04-08 04:59:36 +08:00
Merge pull request #477 from RichardKhanhWin/unit1-typo-fix
Unit 1 typo fixes
This commit is contained in:
@@ -518,7 +518,7 @@
|
||||
"- Action 2: Fire the main engine,\n",
|
||||
"- Action 3: Fire right orientation engine.\n",
|
||||
"\n",
|
||||
"Reward function (the function that will gives a reward at each timestep) 💰:\n",
|
||||
"Reward function (the function that will give a reward at each timestep) 💰:\n",
|
||||
"\n",
|
||||
"After every step a reward is granted. The total reward of an episode is the **sum of the rewards for all the steps within that episode**.\n",
|
||||
"\n",
|
||||
|
||||
@@ -332,7 +332,7 @@ The action space (the set of possible actions the agent can take) is discrete wi
|
||||
- Action 2: Fire the main engine,
|
||||
- Action 3: Fire right orientation engine.
|
||||
|
||||
Reward function (the function that will gives a reward at each timestep) 💰:
|
||||
Reward function (the function that will give a reward at each timestep) 💰:
|
||||
|
||||
After every step a reward is granted. The total reward of an episode is the **sum of the rewards for all the steps within that episode**.
|
||||
|
||||
|
||||
Reference in New Issue
Block a user