mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-04-04 02:57:58 +08:00
Clear up some unneeded and confusing parts of Unit 2
This commit is contained in:
@@ -315,7 +315,7 @@
|
||||
"outputs": [],
|
||||
"source": [
|
||||
"# Create the FrozenLake-v1 environment using 4x4 map and non-slippery version\n",
|
||||
"env = gym.make()"
|
||||
"env = gym.make() # TODO use the correct parameters"
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -344,7 +344,7 @@
|
||||
"id": "KASNViqL4tZn"
|
||||
},
|
||||
"source": [
|
||||
"You can also custom your own grid using:\n",
|
||||
"You can create your own custom grid like this:\n",
|
||||
"\n",
|
||||
"```python\n",
|
||||
"desc=[\"SFFF\", \"FHFH\", \"FFFH\", \"HFFG\"]\n",
|
||||
@@ -389,7 +389,7 @@
|
||||
"For example, the goal position in the 4x4 map can be calculated as follows: 3 * 4 + 3 = 15. The number of possible observations is dependent on the size of the map. **For example, the 4x4 map has 16 possible observations.**\n",
|
||||
"\n",
|
||||
"\n",
|
||||
"For instance, at this state = 0\n"
|
||||
"For instance, this is what state = 0 looks like:\n"
|
||||
]
|
||||
},
|
||||
{
|
||||
@@ -725,7 +725,6 @@
|
||||
"eval_seed = [] # The evaluation seed of the environment\n",
|
||||
"\n",
|
||||
"# Exploration parameters\n",
|
||||
"epsilon = 1.0 # Exploration rate\n",
|
||||
"max_epsilon = 1.0 # Exploration probability at start\n",
|
||||
"min_epsilon = 0.05 # Minimum exploration probability \n",
|
||||
"decay_rate = 0.0005 # Exponential decay rate for exploration prob"
|
||||
@@ -1459,7 +1458,6 @@
|
||||
"gamma = 0.95 # Discounting rate\n",
|
||||
"\n",
|
||||
"# Exploration parameters\n",
|
||||
"epsilon = 1.0 # Exploration rate\n",
|
||||
"max_epsilon = 1.0 # Exploration probability at start\n",
|
||||
"min_epsilon = 0.05 # Minimum exploration probability \n",
|
||||
"decay_rate = 0.005 # Exponential decay rate for exploration prob\n"
|
||||
|
||||
Reference in New Issue
Block a user