Files
openmlsys-zh/README_EN.md
Yeqi Huang 9c289782cc refactor: split CONTRIBUTING docs into separate CN/EN files (#503)
Split each bilingual CONTRIBUTING file into _en.md and _zh.md variants:
- CONTRIBUTING.md → CONTRIBUTING_en.md + CONTRIBUTING_zh.md
- style.md → style_en.md + style_zh.md
- info.md → info_en.md + info_zh.md
- issue.md → issue_en.md + issue_zh.md
- reference_guide.md → reference_guide_en.md + reference_guide_zh.md

terminology.md kept as-is (bilingual lookup table by nature).

Update README.md to link to _zh.md files, README_EN.md to _en.md files.

Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-15 21:51:18 +00:00

145 lines
5.4 KiB
Markdown

<p align="center">
<img src="info/logo-with-text.png" alt="OpenMLSys Logo" width="400"/>
</p>
<p align="center">
<a href="https://github.com/openmlsys/openmlsys-zh/actions/workflows/main.yml">
<img src="https://github.com/openmlsys/openmlsys-zh/actions/workflows/main.yml/badge.svg" alt="CI"/>
</a>
<a href="https://openmlsys.github.io/">
<img src="https://img.shields.io/badge/book-online-blue" alt="Book Online"/>
</a>
<a href="https://github.com/openmlsys/openmlsys-zh/blob/main/LICENSE">
<img src="https://img.shields.io/github/license/openmlsys/openmlsys-zh" alt="License"/>
</a>
<a href="https://github.com/openmlsys/openmlsys-zh/stargazers">
<img src="https://img.shields.io/github/stars/openmlsys/openmlsys-zh?style=social" alt="GitHub Stars"/>
</a>
</p>
<p align="center">
<a href="README.md">中文</a> | <b>English</b>
</p>
---
# Machine Learning Systems: Design and Implementation
An open-source book explaining the design principles and implementation experience of modern machine learning systems, covering the complete technology stack from programming interfaces and computational graphs to compilers and distributed training.
**English version 1 (stable):** [openmlsys.github.io/html-en/](https://openmlsys.github.io/html-en/)
**English version 2:** Under reconstruction.
## Table of Contents
- [Target Audience](#target-audience)
- [Content Overview](#content-overview)
- [Build Guide](#build-guide)
- [Contributing](#contributing)
- [Community](#community)
- [License](#license)
## Target Audience
- **Students**: Those who have mastered machine learning fundamentals and want to deeply understand the design and implementation of modern ML systems.
- **Researchers**: Those who need to develop custom operators or leverage distributed execution for large model development.
- **Engineers**: Those responsible for building ML infrastructure and need to tune system performance or customize ML systems for business needs.
## Content Overview
The book (2nd edition) consists of 9 chapters:
| Chapter | Content |
|---------|---------|
| [Chapter 1: Introduction](v2/en_chapters/01_chapter_introduction/) | Overview of ML system architecture and technology stack |
| [Chapter 2: Programming Interfaces and Computational Graphs](v2/en_chapters/02_chapter_programming_and_graph/) | Tensor abstraction, automatic differentiation, graph representation and execution |
| [Chapter 3: AI Accelerators and Programming](v2/en_chapters/03_chapter_accelerator/) | GPU architecture and CUDA/Triton/CUTLASS programming models |
| [Chapter 4: AI Compilers and Runtime Systems](v2/en_chapters/04_chapter_compiler_and_runtime/) | IR design, graph optimization, kernel generation, and runtime execution |
| [Chapter 5: Data Processing Systems](v2/en_chapters/05_chapter_data_processing/) | Data loading, data pipelines, and distributed data processing |
| [Chapter 6: Training Systems](v2/en_chapters/06_chapter_training_systems/) | Single-node and distributed training, parallelism strategies, and training optimization |
| [Chapter 7: Model Serving](v2/en_chapters/07_chapter_model_serving/) | Inference optimization, online serving, and model management |
| [Chapter 8: RL Systems](v2/en_chapters/08_chapter_rl_systems/) | Reinforcement learning pipelines, environment interaction, and RL system design |
| [Chapter 9: Large-scale GPU Cluster Management](v2/en_chapters/09_chapter_gpu_cluster/) | GPU scheduling, resource management, and large-scale training infrastructure |
## Changelog
| Date | Event |
|------|-------|
| 2022-01 | Project initialized; Chinese content writing begins |
| 2022-05 | Extension chapters released (Federated Learning, RL Systems, Explainable AI) |
| 2023-05 | Codebase adapted to MindSpore 2.0 |
| 2026-03 | Bilingual (CN/EN) build architecture refactored; English version launched |
## Build Guide
### Prerequisites
- curl
- git
- Python 3
### Installation
```bash
# Clone the repository
git clone https://github.com/openmlsys/openmlsys-zh.git
cd openmlsys-zh
# Install Rust toolchain (Linux/macOS)
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
# Install mdbook
cargo install mdbook
```
### Build HTML
```bash
sh build_mdbook_v2.sh
# English output: .mdbook-v2/book
# Chinese output: .mdbook-v2-zh/book
```
For more details, see the [Build Guide](CONTRIBUTING/info_en.md).
## Contributing
We welcome all forms of contributions. For the full workflow, see the **[Contributing Guide](CONTRIBUTING/CONTRIBUTING_en.md)**.
Before contributing, please read:
- [Writing Style Guide](CONTRIBUTING/style_en.md)
- [Terminology Guide](CONTRIBUTING/terminology.md)
## Community
<p align="center">
<img src="info/mlsys_group.png" alt="微信群二维码" width="200"/>
<br/>
Join our WeChat group by scanning the QR code
</p>
## Citation
If this book has been helpful to your research or work, please cite it as:
**Plain text:**
> OpenMLSys Team. *Machine Learning Systems: Design and Implementation*. 2022. https://openmlsys.github.io/
**BibTeX:**
```bibtex
@book{openmlsys2022,
title = {Machine Learning Systems: Design and Implementation},
author = {OpenMLSys Team},
year = {2022},
url = {https://openmlsys.github.io/},
note = {Open-source textbook, \url{https://github.com/openmlsys/openmlsys-zh}}
}
```
## License
This project is licensed under the [Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License](https://creativecommons.org/licenses/by-nc-sa/4.0/).