Efficient resource allocation with fairness constraints in restless multi-armed bandits D Li, P Varakantham Uncertainty in Artificial Intelligence, 1158-1167, 2022 | 13 | 2022 |
CLAIM: Curriculum learning policy for influence maximization in unknown social networks D Li, M Lowalekar, P Varakantham Uncertainty in Artificial Intelligence, 1455-1465, 2021 | 10 | 2021 |
Toolace: Winning the points of llm function calling W Liu, X Huang, X Zeng, X Hao, S Yu, D Li, S Wang, W Gan, Z Liu, Y Yu, ... arXiv preprint arXiv:2409.00920, 2024 | 7 | 2024 |
Towards soft fairness in restless multi-armed bandits D Li, P Varakantham arXiv preprint arXiv:2207.13343, 2022 | 7 | 2022 |
Aligning crowd feedback via distributional preference reward modeling D Li, C Zhang, K Dong, DGX Deik, R Tang, Y Liu arXiv preprint arXiv:2402.09764, 2024 | 6 | 2024 |
Effective diversity in unsupervised environment design W Li, P Varakantham, D Li arXiv preprint arXiv:2301.08025, 2023 | 6 | 2023 |
Diversity induced environment design via self-play D Li, W Li, P Varakantham arXiv preprint arXiv:2302.02119, 2023 | 5 | 2023 |
Avoiding starvation of arms in restless multi-armed bandit D Li, P Varakantham International Foundation for Autonomous Agents and Multiagent Systems, 2023 | 4 | 2023 |
Meta-task planning for language agents C Zhang, DGX Deik, D Li, H Zhang, Y Liu arXiv preprint arXiv:2405.16510, 2024 | 2 | 2024 |
Generalization through diversity: improving unsupervised environment design W Li, P Varakantham, D Li arXiv preprint arXiv:2301.08025, 2023 | 2 | 2023 |
Enhancing the hierarchical environment design via generative trajectory modeling D Li, P Varakantham | 1 | 2024 |
MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents K Dong, Y Chang, XD Goh, D Li, R Tang, Y Liu arXiv preprint arXiv:2501.08828, 2025 | | 2025 |
Planning with Multi-Constraints via Collaborative Language Agents C Zhang, XD Goh, D Li, H Zhang, Y Liu Proceedings of the 31st International Conference on Computational …, 2025 | | 2025 |
EduQate: Generating Adaptive Curricula through RMABs in Education Settings S Tio, D Li, P Varakantham arXiv preprint arXiv:2406.14122, 2024 | | 2024 |
Sequential decision learning for social good and fairness D LI Singapore Management University, 2024 | | 2024 |
A Hierarchical Approach to Environment Design with Generative Trajectory Modeling D Li, P Varakantham arXiv preprint arXiv:2310.00301, 2023 | | 2023 |
Hidden State Approximation in Recurrent Neural Networks Using Continuous Particle Filtering D Li arXiv preprint arXiv:2212.09008, 2022 | | 2022 |
Marginal Benefit Induced Unsupervised Environment Design D Li, W Li, P Varakantham | | |