Files
openmlsys-zh/chapter_data_processing/reference.md
2022-03-03 08:54:20 +00:00

1.4 KiB

引用

[1] Meijer, E., Beckman, B., & Bierman, G. (2006, June). Linq: reconciling object, relations and xml in the. net framework. In Proceedings of the 2006 ACM SIGMOD international conference on Management of data (pp. 706-706).

[2] Murray, D. G., McSherry, F., Isaacs, R., Isard, M., Barham, P., & Abadi, M. (2013, November). Naiad: a timely dataflow system. In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles (pp. 439-455).

[3] Zaharia, M., Chowdhury, M., Franklin, M. J., Shenker, S., & Stoica, I. (2010). Spark: Cluster computing with working sets. HotCloud, 10(10-10), 95.

[4] Fetterly, Y. Y. M. I. D., Budiu, M., Erlingsson, Ú., & Currey, P. K. G. J. (2009). DryadLINQ: A system for general-purpose distributed data-parallel computing using a high-level language. Proc. LSDS-IR, 8.

[5] Murray, D. G., Simsa, J., Klimovic, A., & Indyk, I. (2021). tf. data: A Machine Learning Data Processing Framework. arXiv preprint arXiv:2101.12127.

[6] Mohan, J., Phanishayee, A., Raniwala, A., & Chidambaram, V. (2020). Analyzing and mitigating data stalls in DNN training. arXiv preprint arXiv:2007.06775.

[7] https://docs.google.com/document/d/18CXhDb1ygxg-YXNBJNzfzZsDFosB5e6BfnXLlejd9l0/edit#.

[8] https://github.com/NVIDIA/DALI.

[9] https://docs.ray.io/en/latest/data/dataset.html.

[10] https://gitee.com/mindspore/dataset-plugin.