Follow
Jing Yu Koh
Jing Yu Koh
Meta
Verified email at meta.com - Homepage
Title
Cited by
Cited by
Year
Scaling autoregressive models for content-rich text-to-image generation
J Yu, Y Xu, JY Koh, T Luong, G Baid, Z Wang, V Vasudevan, A Ku, Y Yang, ...
TMLR, 2022
11092022
Vector-quantized image modeling with improved vqgan
J Yu, X Li, JY Koh, H Zhang, R Pang, J Qin, A Ku, Y Xu, J Baldridge, Y Wu
ICLR, 2021
4482021
Cross-Modal Contrastive Learning for Text-to-Image Generation
H Zhang*, JY Koh*, J Baldridge, H Lee, Y Yang
CVPR, 2021
4052021
Generating images with multimodal language models
JY Koh, D Fried, R Salakhutdinov
NeurIPS, 2023
2332023
Grounding Language Models to Images for Multimodal Inputs and Outputs
JY Koh, R Salakhutdinov, D Fried
ICML, 2023
1872023
Visualwebarena: Evaluating multimodal agents on realistic visual web tasks
JY Koh, R Lo, L Jang, V Duvvur, MC Lim, PY Huang, G Neubig, S Zhou, ...
ACL, 2024
1202024
Pathdreamer: A World Model for Indoor Navigation
JY Koh, H Lee, Y Yang, J Baldridge, P Anderson
ICCV, 2021
772021
Text-to-image generation grounded by fine-grained user attention
JY Koh, J Baldridge, H Lee, Y Yang
WACV, 2021
762021
A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning
A Kamath, P Anderson, S Wang, JY Koh, A Ku, A Waters, Y Yang, ...
CVPR, 2022
49*2022
OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web
R Kapoor, YP Butala, M Russak, JY Koh, K Kamble, W Alshikh, ...
ECCV, 2024
39*2024
Revisiting hierarchical approach for persistent long-term video prediction
W Lee, W Jung, H Zhang, T Chen, JY Koh, T Huang, H Yoon, H Lee, ...
ICLR, 2021
292021
Vq3d: Learning a 3d-aware generative model on imagenet
K Sargent, JY Koh, H Zhang, H Chang, C Herrmann, P Srinivasan, J Wu, ...
ICCV, 2023
282023
Simple and Effective Synthesis of Indoor 3D Scenes
JY Koh*, H Agrawal*, D Batra, R Tucker, A Waters, H Lee, Y Yang, ...
AAAI, 2022
272022
Improving Customer Satisfaction in Bike Sharing Systems through Dynamic Repositioning
S Ghosh*, JY Koh*, P Jaillet
IJCAI, 2019
272019
Tree Search for Language Model Agents
JY Koh, S McAleer, D Fried, R Salakhutdinov
arXiv preprint arXiv:2407.01476, 2024
242024
Urban Zoning Using Higher-Order Markov Random Fields on Multi-View Imagery Data
T Feng, QT Truong, T Nguyen, JY Koh, LF Yu, SK Yeung, A Binder
ECCV, 2018
222018
Twitter-informed crowd flow prediction
G Goh, JY Koh, Y Zhang
ICDM Workshops, 2018
172018
Dissecting Adversarial Robustness of Multimodal LM Agents
CH Wu, RR Shah, JY Koh, R Salakhutdinov, D Fried, A Raghunathan
NeurIPS 2024 Workshop on Open-World Agents, 2024
16*2024
Multimodal graph learning for generative tasks
M Yoon, JY Koh, B Hooi, R Salakhutdinov
arXiv preprint arXiv:2310.07478, 2023
82023
Systems And Methods For Generating Predicted Visual Observations Of An Environment Using Machine Learned Models
JY Koh, H Lee, Y Yang, JM Baldridge, PJ Anderson
US Patent App. 17/409,249, 2023
62023
The system can't perform the operation now. Try again later.
Articles 1–20