Active example selection for in-context learning Y Zhang, S Feng, C Tan arXiv preprint arXiv:2211.04486, 2022 | 135 | 2022 |
Effective Prompt Extraction from Language Models Y Zhang, N Carlini, D Ippolito arXiv preprint arXiv:2307.06865, 2023 | 42* | 2023 |
Conversations gone alright: Quantifying and predicting prosocial outcomes in online conversations J Bao, J Wu, Y Zhang, E Chandrasekharan, D Jurgens Proceedings of the Web Conference 2021, 1134-1145, 2021 | 38 | 2021 |
Selective explanations: Leveraging human input to align explainable ai V Lai, Y Zhang, C Chen, QV Liao, C Tan Proceedings of the ACM on Human-Computer Interaction 7 (CSCW2), 1-35, 2023 | 30 | 2023 |
Flame: Few-shot learning from natural language explanations Y Zhou, Y Zhang, C Tan arXiv preprint arXiv:2306.08042, 2023 | 10 | 2023 |
Biasx:" thinking slow" in toxic content moderation with explanations of implied social biases Y Zhang, S Nanduri, L Jiang, T Wu, M Sap arXiv preprint arXiv:2305.13589, 2023 | 5 | 2023 |
Learning to Ignore Adversarial Attacks Y Zhang, Y Zhou, S Carton, C Tan arXiv preprint arXiv:2205.11551, 2022 | 4 | 2022 |
OpenHEXAI: An Open-Source Framework for Human-Centered Evaluation of Explainable Machine Learning J Ma, V Lai, Y Zhang, C Chen, P Hamilton, D Ljubenkov, H Lakkaraju, ... arXiv preprint arXiv:2403.05565, 2024 | 3 | 2024 |
Backtracking improves generation safety Y Zhang, J Chi, H Nguyen, K Upasani, DM Bikel, J Weston, EM Smith arXiv preprint arXiv:2409.14586, 2024 | 1 | 2024 |
Forcing Diffuse Distributions out of Language Models Y Zhang, A Schwarzschild, N Carlini, Z Kolter, D Ippolito arXiv preprint arXiv:2404.10859, 2024 | 1 | 2024 |
Building a Flexible Knowledge Graph to Capture Real-World Events. L Burdick, O Ignat, Y Zhang, R Mihalcea, M Wang, S Wilson, Y Wei, ... TAC, 2019 | 1 | 2019 |
Human-aligned Chess with a Bit of Search Y Zhang, AP Jacob, V Lai, D Fried, D Ippolito arXiv preprint arXiv:2410.03893, 2024 | | 2024 |