Vltint: Visual-linguistic transformer-in-transformer for coherent video paragraph captioning K Yamazaki, K Vo, QS Truong, B Raj, N Le Proceedings of the AAAI Conference on Artificial intelligence 37 (3), 3081-3090, 2023 | 40 | 2023 |
Multi-module recurrent convolutional neural network with transformer encoder for ECG arrhythmia classification MD Le, VS Rathour, QS Truong, Q Mai, P Brijesh, N Le 2021 IEEE EMBS International Conference on Biomedical and Health Informatics …, 2021 | 35 | 2021 |
Vlcap: Vision-language with contrastive learning for coherent video paragraph captioning K Yamazaki, S Truong, K Vo, M Kidd, C Rainwater, K Luu, N Le 2022 IEEE International Conference on Image Processing (ICIP), 3656-3661, 2022 | 34 | 2022 |
Aoe-net: Entities interactions modeling with adaptive attention mechanism for temporal action proposals generation K Vo, S Truong, K Yamazaki, B Raj, MT Tran, N Le International Journal of Computer Vision 131 (1), 302-323, 2023 | 27 | 2023 |
scl-st: Supervised contrastive learning with semantic transformations for multiple lead ecg arrhythmia classification D Le, S Truong, P Brijesh, DA Adjeroh, N Le IEEE journal of biomedical and health informatics 27 (6), 2818-2828, 2023 | 26 | 2023 |
Aei: Actors-environment interaction with adaptive attention for temporal action proposals generation K Vo, H Joo, K Yamazaki, S Truong, K Kitani, MT Tran, N Le arXiv preprint arXiv:2110.11474, 2021 | 20 | 2021 |
Abn: Agent-aware boundary networks for temporal action proposal generation K Vo, K Yamazaki, S Truong, MT Tran, A Sugimoto, N Le IEEE Access 9, 126431-126445, 2021 | 17 | 2021 |
CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect M Tran, S Truong, AFA Fernandes, MT Kidd, N Le Poultry Science, 103765, 2024 | 1 | 2024 |
Towards Multi-modal Interpretable Video Understanding QS Truong University of Arkansas, 2023 | | 2023 |