## 引用 \[1\] Meijer, E., Beckman, B., & Bierman, G. (2006, June). Linq: reconciling object, relations and xml in the. net framework. In Proceedings of the 2006 ACM SIGMOD international conference on Management of data (pp. 706-706). \[2\] Murray, D. G., McSherry, F., Isaacs, R., Isard, M., Barham, P., & Abadi, M. (2013, November). Naiad: a timely dataflow system. In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles (pp. 439-455). \[3\] Zaharia, M., Chowdhury, M., Franklin, M. J., Shenker, S., & Stoica, I. (2010). Spark: Cluster computing with working sets. HotCloud, 10(10-10), 95. \[4\] Fetterly, Y. Y. M. I. D., Budiu, M., Erlingsson, Ú., & Currey, P. K. G. J. (2009). DryadLINQ: A system for general-purpose distributed data-parallel computing using a high-level language. Proc. LSDS-IR, 8. \[5\] Murray, D. G., Simsa, J., Klimovic, A., & Indyk, I. (2021). tf. data: A Machine Learning Data Processing Framework. arXiv preprint arXiv:2101.12127. \[6\] Mohan, J., Phanishayee, A., Raniwala, A., & Chidambaram, V. (2020). Analyzing and mitigating data stalls in DNN training. arXiv preprint arXiv:2007.06775. \[7\] https://docs.google.com/document/d/18CXhDb1ygxg-YXNBJNzfzZsDFosB5e6BfnXLlejd9l0/edit#. \[8\] https://github.com/NVIDIA/DALI. \[9\] https://docs.ray.io/en/latest/data/dataset.html. \[10\] https://gitee.com/mindspore/dataset-plugin.