openmlsys-zh/chapter_reinforcement_learning/summary.md at main

mirror of https://github.com/openmlsys/openmlsys-zh.git synced 2026-02-03 10:23:55 +08:00

Files

Zihan Ding 4ca2b6f698 chapter RL modified according to publisher feedback (#434 )

* fix chap12 render

* add distributed rl chapter

* fix bug

* fix issue #212

* fix typo

* update imgs

* fix chinese

* fix svg img

* update contents in rl chapter

* update marl sys

* fix a fig

* fix ref

* fix error

* update rl chapter according to chinese modification

* Revert "update rl chapter according to chinese modification"

This reverts commit 413f5b679f.

* update rl chapter with chinese modification

---------

Co-authored-by: Dalong <39682259+eedalong@users.noreply.github.com>

2023-03-26 08:21:38 +08:00

861 B

Raw Permalink Blame History

小结

在这一章，我们简单介绍了强化学习的基本概念，包括单智能体和多智能体强化学习算法、单节点和分布式强化学习系统等，给读者对强化学习问题的基本认识。当前，强化学习是一个快速发展的深度学习分支，许多实际问题都有可能通过强化学习算法的进一步发展得到解决。另一方面，由于强化学习问题设置的特殊性（如需要与环境交互进行采样等），也使得相应算法对计算系统的要求更高：如何更好地平衡样本采集和策略训练过程？如何均衡 CPU 和 GPU 等不同计算硬件的能力？如何在大规模分布式系统上有效部署强化学习智能体？都需要对计算机系统的设计和使用有更好的理解。

参考文献

:bibliography:../references/reinforcement.bib

861 B Raw Permalink Blame History

小结

参考文献

861 B

Raw Permalink Blame History