mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-04-05 03:28:05 +08:00
Update colab (PPO link)
This commit is contained in:
@@ -7,7 +7,7 @@
|
||||
"provenance": [],
|
||||
"collapsed_sections": [],
|
||||
"private_outputs": true,
|
||||
"authorship_tag": "ABX9TyNPB+iXGKgIWKts27HKZacW",
|
||||
"authorship_tag": "ABX9TyNhsp4tLJxwa+BsFeBThLb7",
|
||||
"include_colab_link": true
|
||||
},
|
||||
"kernelspec": {
|
||||
@@ -232,7 +232,7 @@
|
||||
"source": [
|
||||
"## Step 4: Create the PPO Model 🤖\n",
|
||||
"\n",
|
||||
"PPO is one of the SOTA (state of the art) Deep Reinforcement Learning algorithms. If you don't know how it works, you can check this blogpost and the paper\n",
|
||||
"PPO is one of the SOTA (state of the art) Deep Reinforcement Learning algorithms. \n",
|
||||
"\n",
|
||||
"In this case, because we have a vector as input, we'll use an MLP (multi-layer perceptron) as policy.\n",
|
||||
"\n",
|
||||
|
||||
Reference in New Issue
Block a user