Follow
John Yang
Title
Cited by
Cited by
Year
Webshop: Towards scalable real-world web interaction with grounded language agents
S Yao, H Chen, J Yang, K Narasimhan
NeurIPS 2022, 2022
1742022
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
CE Jimenez, J Yang, A Wettig, S Yao, K Pei, O Press, K Narasimhan
ICLR 2024, 2023
862023
InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback
J Yang, A Prabhakar, K Narasimhan, S Yao
NeurIPS 2023 (Datasets & Benchmarks), 2023
382023
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering
J Yang, CE Jimenez, A Wettig, K Lieret, S Yao, K Narasimhan, O Press
arXiv preprint arXiv:2405.15793, 2024
82024
Language Agents as Hackers: Evaluating Cybersecurity Skills with Capture the Flag
J Yang, A Prabhakar, S Yao, K Pei, KR Narasimhan
Multi-Agent Security Workshop @ NeurIPS 2023, 2023
42023
Quartz: A framework for engineering secure smart contracts
J Kolb, J Yang, RH Katz, DE Culler
EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS …, 2020
32020
Devbench: A comprehensive benchmark for software development
B Li, W Wu, Z Tang, L Shi, J Yang, J Li, S Yao, C Qian, B Hui, Q Zhang, ...
arXiv preprint arXiv:2403.08604, 2024
22024
Referral Augmentation for Zero-Shot Information Retrieval
M Tang, S Yao, J Yang, K Narasimhan
ACL 2024 (Findings), 2023
2023
Learning Language through Interactions with the Digital World
JB Yang
Princeton University, 2023
2023
Towards an Enhanced, Faithful, and Adaptable Web Interaction Environment
J Yang, H Chen, KR Narasimhan
Second Workshop on Language and Reinforcement Learning @ NeurIPS 2022, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–10