Melon: Breaking the memory wall for resource-efficient on-device machine learning Q Wang, M Xu, C Jin, X Dong, J Yuan, X Jin, G Huang, Y Liu, X Liu Proceedings of the 20th Annual International Conference on Mobile Systems …, 2022 | 58 | 2022 |
Ragcache: Efficient knowledge caching for retrieval-augmented generation C Jin, Z Zhang, X Jiang, F Liu, X Liu, X Liu, X Jin arXiv preprint arXiv:2404.12457, 2024 | 41 | 2024 |
Ditto: Efficient serverless analytics with elastic parallelism C Jin, Z Zhang, X Xiang, S Zou, G Huang, X Liu, X Jin Proceedings of the ACM SIGCOMM 2023 Conference, 406-419, 2023 | 18 | 2023 |
Fast, approximate vector queries on very large unstructured datasets Z Zhang, C Jin, L Tang, X Liu, X Jin 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2023 | 4 | 2023 |
Jolteon: unleashing the promise of serverless for serverless workflows Z Zhang, C Jin, X Jin 21st USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2024 | 1 | 2024 |
Towards Swift Serverless LLM Cold Starts with ParaServe C Lou, S Qi, C Jin, D Nie, H Yang, X Liu, X Jin arXiv preprint arXiv:2502.15524, 2025 | | 2025 |
Pyxis: Scheduling Mixed Tasks in Disaggregated Datacenters S Qi, C Jin, M Chowdhury, Z Liu, X Liu, X Jin IEEE Transactions on Parallel and Distributed Systems, 2024 | | 2024 |