Breaking the computation and communication abstraction barrier in distributed machine learning workloads A Jangda, J Huang, G Liu, AHN Sabet, S Maleki, Y Miao, M Musuvathi, ... Proceedings of the 27th ACM International Conference on Architectural …, 2022 | 27 | 2022 |
Superscaler: Supporting flexible dnn parallelization via a unified abstraction Z Lin, Y Miao, G Liu, X Shi, Q Zhang, F Yang, S Maleki, Y Zhu, X Cao, C Li, ... arXiv preprint arXiv:2301.08984, 2023 | 3 | 2023 |
SEER: A Time Prediction Model for CNNs from GPU Kernel's View G Liu, S Wang, Y Bao 2021 30th International Conference on Parallel Architectures and Compilation …, 2021 | 3 | 2021 |
Aceso: Efficient Parallel DNN Training through Iterative Bottleneck Alleviation G Liu, Y Miao, Z Lin, X Shi, S Maleki, F Yang, Y Bao, S Wang Proceedings of the Nineteenth European Conference on Computer Systems, 163-181, 2024 | | 2024 |