Codebert: A pre-trained model for programming and natural languages Z Feng, D Guo, D Tang, N Duan, X Feng, M Gong, L Shou, B Qin, T Liu, ... arXiv preprint arXiv:2002.08155, 2020 | 2953 | 2020 |
Graphcodebert: Pre-training code representations with data flow D Guo, S Ren, S Lu, Z Feng, D Tang, S Liu, L Zhou, N Duan, ... arXiv preprint arXiv:2009.08366, 2020 | 1306* | 2020 |
Codexglue: A machine learning benchmark dataset for code understanding and generation S Lu, D Guo, S Ren, J Huang, A Svyatkovskiy, A Blanco, C Clement, ... arXiv preprint arXiv:2102.04664, 2021 | 1200* | 2021 |
Unixcoder: Unified cross-modal pre-training for code representation D Guo, S Lu, N Duan, Y Wang, M Zhou, J Yin arXiv preprint arXiv:2203.03850, 2022 | 599 | 2022 |
DeepSeek-Coder: When the Large Language Model Meets Programming--The Rise of Code Intelligence D Guo, Q Zhu, D Yang, Z Xie, K Dong, W Zhang, G Chen, X Bi, Y Wu, ... arXiv preprint arXiv:2401.14196, 2024 | 490 | 2024 |
Codebleu: a method for automatic evaluation of code synthesis S Ren, D Guo, S Lu, L Zhou, S Liu, D Tang, N Sundaresan, M Zhou, ... arXiv preprint arXiv:2009.10297, 2020 | 462 | 2020 |
Baize: An open-source chat model with parameter-efficient tuning on self-chat data C Xu, D Guo, N Duan, J McAuley arXiv preprint arXiv:2304.01196, 2023 | 296 | 2023 |
Graph-based reasoning over heterogeneous external knowledge for commonsense question answering S Lv, D Guo, J Xu, D Tang, N Duan, M Gong, L Shou, D Jiang, G Cao, ... Proceedings of the AAAI conference on artificial intelligence 34 (05), 8449-8456, 2020 | 223 | 2020 |
Deepseekmath: Pushing the limits of mathematical reasoning in open language models Z Shao, P Wang, Q Zhu, R Xu, J Song, X Bi, H Zhang, M Zhang, YK Li, ... arXiv preprint arXiv:2402.03300, 2024 | 208 | 2024 |
Automating code review activities by large-scale pre-training Z Li, S Lu, D Guo, N Duan, S Jannu, G Jenks, D Majumder, J Green, ... Proceedings of the 30th ACM Joint European Software Engineering Conference …, 2022 | 207* | 2022 |
Dialog-to-action: Conversational question answering over a large-scale knowledge base D Guo, D Tang, N Duan, M Zhou, J Yin Advances in neural information processing systems 31, 2018 | 148 | 2018 |
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence Q Zhu, D Guo, Z Shao, D Yang, P Wang, R Xu, Y Wu, Y Li, H Gao, S Ma, ... arXiv preprint arXiv:2406.11931, 2024 | 139* | 2024 |
Reacc: A retrieval-augmented code completion framework S Lu, N Duan, H Han, D Guo, S Hwang, A Svyatkovskiy arXiv preprint arXiv:2203.07722, 2022 | 120 | 2022 |
Deepseek-v2: A strong, economical, and efficient mixture-of-experts language model A Liu, B Feng, B Wang, B Wang, B Liu, C Zhao, C Dengr, C Ruan, D Dai, ... arXiv preprint arXiv:2405.04434, 2024 | 117 | 2024 |
Multi-task learning for conversational question answering over a large-scale knowledge base T Shen, X Geng, T Qin, D Guo, D Tang, N Duan, G Long, D Jiang arXiv preprint arXiv:1910.05069, 2019 | 101 | 2019 |
CodeBERT: a pre-trained model for programming and natural languages (2020) Z Feng, D Guo, D Tang, N Duan, X Feng, M Gong, L Shou, B Qin, T Liu, ... arXiv preprint arXiv:2002.08155, 2002 | 83 | 2002 |
Deepseek llm: Scaling open-source language models with longtermism X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ... arXiv preprint arXiv:2401.02954, 2024 | 78 | 2024 |
Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning D Guo, D Yang, H Zhang, J Song, R Zhang, R Xu, Q Zhu, S Ma, P Wang, ... arXiv preprint arXiv:2501.12948, 2025 | 64 | 2025 |
Deepseek-v3 technical report A Liu, B Feng, B Xue, B Wang, B Wu, C Lu, C Zhao, C Deng, C Zhang, ... arXiv preprint arXiv:2412.19437, 2024 | 63 | 2024 |
Longcoder: A long-range pre-trained language model for code completion D Guo, C Xu, N Duan, J Yin, J McAuley International Conference on Machine Learning, 12098-12107, 2023 | 63 | 2023 |