Follow
Alexander H. Liu
Alexander H. Liu
Verified email at mit.edu - Homepage
Title
Cited by
Cited by
Year
A unified feature disentangler for multi-domain image translation and manipulation
AH Liu, YC Liu, YY Yeh, YCF Wang
Advances in neural information processing systems 31, 2018
4112018
Towards scene understanding: Unsupervised monocular depth estimation with semantic-aware representation
PY Chen, AH Liu, YC Liu, YCF Wang
Proceedings of the IEEE/CVF Conference on computer vision and pattern …, 2019
2712019
Contrastive audio-visual masked autoencoder
Y Gong, A Rouditchenko, AH Liu, D Harwath, L Karlinsky, H Kuehne, ...
arXiv preprint arXiv:2210.07839, 2022
1262022
Listen, think, and understand
Y Gong, H Luo, AH Liu, L Karlinsky, J Glass
arXiv preprint arXiv:2305.10790, 2023
1152023
Non-autoregressive predictive coding for learning speech representations from local dependencies
AH Liu, YA Chung, J Glass
arXiv preprint arXiv:2011.00406, 2020
1002020
Towards end-to-end unsupervised speech recognition
AH Liu, WN Hsu, M Auli, A Baevski
2022 IEEE Spoken Language Technology Workshop (SLT), 221-228, 2023
752023
Spoken moments: Learning joint audio-visual representations from video descriptions
M Monfort, SY Jin, A Liu, D Harwath, R Feris, J Glass, A Oliva
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
692021
Parp: Prune, adjust and re-prune for self-supervised speech recognition
CIJ Lai, Y Zhang, AH Liu, S Chang, YL Liao, YS Chuang, K Qian, ...
Advances in Neural Information Processing Systems 34, 21256-21272, 2021
672021
Towards unsupervised speech recognition and synthesis with quantized speech representation learning
AH Liu, T Tu, H Lee, L Lee
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
552020
Adversarial training of end-to-end speech recognition using a criticizing language model
AH Liu, H Lee, L Lee
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
522019
Joint audio and speech understanding
Y Gong, AH Liu, H Luo, L Karlinsky, J Glass
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
452023
Cross-modal discrete representation learning
AH Liu, SY Jin, CIJ Lai, A Rouditchenko, A Oliva, J Glass
arXiv preprint arXiv:2106.05438, 2021
432021
Simple and effective unsupervised speech synthesis
AH Liu, CIJ Lai, WN Hsu, M Auli, A Baevski, J Glass
arXiv preprint arXiv:2204.02524, 2022
202022
Uavm: Towards unifying audio and visual models
Y Gong, AH Liu, A Rouditchenko, J Glass
IEEE Signal Processing Letters 29, 2437-2441, 2022
192022
Worse wer, but better bleu? leveraging word embedding as intermediate in multitask end-to-end speech translation
SP Chuang, TW Sung, AH Liu, H Lee
arXiv preprint arXiv:2005.10678, 2020
182020
Generative pre-training for speech with flow matching
AH Liu, M Le, A Vyas, B Shi, A Tjandra, WN Hsu
arXiv preprint arXiv:2310.16338, 2023
172023
Improving automatic speech recognition and speech translation via word embedding prediction
SP Chuang, AH Liu, TW Sung, H Lee
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 93-105, 2020
172020
Self-supervised fine-tuning for improved content representations by speaker-invariant clustering
HJ Chang, AH Liu, J Glass
arXiv preprint arXiv:2305.11072, 2023
162023
Towards audio language modeling-an overview
H Wu, X Chen, YC Lin, K Chang, HL Chung, AH Liu, H Lee
arXiv preprint arXiv:2402.13236, 2024
152024
Dinosr: Self-distillation and online clustering for self-supervised speech representation learning
AH Liu, HJ Chang, M Auli, WN Hsu, J Glass
Advances in Neural Information Processing Systems 36, 2024
142024
The system can't perform the operation now. Try again later.
Articles 1–20