A unified feature disentangler for multi-domain image translation and manipulation AH Liu, YC Liu, YY Yeh, YCF Wang Advances in neural information processing systems 31, 2018 | 411 | 2018 |
Towards scene understanding: Unsupervised monocular depth estimation with semantic-aware representation PY Chen, AH Liu, YC Liu, YCF Wang Proceedings of the IEEE/CVF Conference on computer vision and pattern …, 2019 | 271 | 2019 |
Contrastive audio-visual masked autoencoder Y Gong, A Rouditchenko, AH Liu, D Harwath, L Karlinsky, H Kuehne, ... arXiv preprint arXiv:2210.07839, 2022 | 126 | 2022 |
Listen, think, and understand Y Gong, H Luo, AH Liu, L Karlinsky, J Glass arXiv preprint arXiv:2305.10790, 2023 | 115 | 2023 |
Non-autoregressive predictive coding for learning speech representations from local dependencies AH Liu, YA Chung, J Glass arXiv preprint arXiv:2011.00406, 2020 | 100 | 2020 |
Towards end-to-end unsupervised speech recognition AH Liu, WN Hsu, M Auli, A Baevski 2022 IEEE Spoken Language Technology Workshop (SLT), 221-228, 2023 | 75 | 2023 |
Spoken moments: Learning joint audio-visual representations from video descriptions M Monfort, SY Jin, A Liu, D Harwath, R Feris, J Glass, A Oliva Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 69 | 2021 |
Parp: Prune, adjust and re-prune for self-supervised speech recognition CIJ Lai, Y Zhang, AH Liu, S Chang, YL Liao, YS Chuang, K Qian, ... Advances in Neural Information Processing Systems 34, 21256-21272, 2021 | 67 | 2021 |
Towards unsupervised speech recognition and synthesis with quantized speech representation learning AH Liu, T Tu, H Lee, L Lee ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 55 | 2020 |
Adversarial training of end-to-end speech recognition using a criticizing language model AH Liu, H Lee, L Lee ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 52 | 2019 |
Joint audio and speech understanding Y Gong, AH Liu, H Luo, L Karlinsky, J Glass 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 45 | 2023 |
Cross-modal discrete representation learning AH Liu, SY Jin, CIJ Lai, A Rouditchenko, A Oliva, J Glass arXiv preprint arXiv:2106.05438, 2021 | 43 | 2021 |
Simple and effective unsupervised speech synthesis AH Liu, CIJ Lai, WN Hsu, M Auli, A Baevski, J Glass arXiv preprint arXiv:2204.02524, 2022 | 20 | 2022 |
Uavm: Towards unifying audio and visual models Y Gong, AH Liu, A Rouditchenko, J Glass IEEE Signal Processing Letters 29, 2437-2441, 2022 | 19 | 2022 |
Worse wer, but better bleu? leveraging word embedding as intermediate in multitask end-to-end speech translation SP Chuang, TW Sung, AH Liu, H Lee arXiv preprint arXiv:2005.10678, 2020 | 18 | 2020 |
Generative pre-training for speech with flow matching AH Liu, M Le, A Vyas, B Shi, A Tjandra, WN Hsu arXiv preprint arXiv:2310.16338, 2023 | 17 | 2023 |
Improving automatic speech recognition and speech translation via word embedding prediction SP Chuang, AH Liu, TW Sung, H Lee IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 93-105, 2020 | 17 | 2020 |
Self-supervised fine-tuning for improved content representations by speaker-invariant clustering HJ Chang, AH Liu, J Glass arXiv preprint arXiv:2305.11072, 2023 | 16 | 2023 |
Towards audio language modeling-an overview H Wu, X Chen, YC Lin, K Chang, HL Chung, AH Liu, H Lee arXiv preprint arXiv:2402.13236, 2024 | 15 | 2024 |
Dinosr: Self-distillation and online clustering for self-supervised speech representation learning AH Liu, HJ Chang, M Auli, WN Hsu, J Glass Advances in Neural Information Processing Systems 36, 2024 | 14 | 2024 |