mirror of
https://github.com/huggingface/deep-rl-class.git
synced 2026-04-09 05:40:29 +08:00
Fix typo: update self-play.mdx
This commit is contained in:
@@ -24,7 +24,7 @@ This solution is called *self-play*. In self-play, **the agent uses former copie
|
||||
It’s the same way humans learn in competition:
|
||||
|
||||
- We start to train against an opponent of similar level
|
||||
- Then we learn from it, and when we acquired some skills, we can move further with stronger opponents.
|
||||
- Then we learn from it, and when we acquire some skills, we can move further with stronger opponents.
|
||||
|
||||
We do the same with self-play:
|
||||
|
||||
|
||||
Reference in New Issue
Block a user