Superb: Speech processing universal performance benchmark S Yang, PH Chi, YS Chuang, CIJ Lai, K Lakhotia, YY Lin, AT Liu, J Shi, ... arXiv preprint arXiv:2105.01051, 2021 | 907 | 2021 |
Fragmentvc: Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention YY Lin, CM Chien, JH Lin, H Lee, L Lee ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 84 | 2021 |
S2VC: A framework for any-to-any voice conversion with self-supervised pretrained representations J Lin, YY Lin, CM Chien, H Lee arXiv preprint arXiv:2104.02901, 2021 | 64 | 2021 |
Utilizing self-supervised representations for MOS prediction WC Tseng, C Huang, WT Kao, YY Lin, H Lee arXiv preprint arXiv:2104.03017, 2021 | 63 | 2021 |
Defending your voice: Adversarial attack on voice conversion C Huang, YY Lin, H Lee, L Lee 2021 IEEE Spoken Language Technology Workshop (SLT), 552-559, 2021 | 55 | 2021 |
A Large-Scale Evaluation of Speech Foundation Models S Yang, HJ Chang, Z Huang, AT Liu, CI Lai, H Wu, J Shi, X Chang, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 10 | 2024 |
Improving Frame-level Classifier for Word Timings with Non-peaky CTC in End-to-End Automatic Speech Recognition X Chen, YY Lin, K Wang, Y He, Z Ma arXiv preprint arXiv:2306.07949, 2023 | 4 | 2023 |
Random utterance concatenation based data augmentation for improving short-video speech recognition YY Lin, T Han, H Xu, VT Pham, Y Khassanov, TY Chong, Y He, L Lu, Z Ma arXiv preprint arXiv:2210.15876, 2022 | 2 | 2022 |
A Comprehensive Solution to Connect Speech Encoder and Large Language Model for ASR VT Pham, Y Lin, T Han, W Li, J Zhang, L Lu, Y Wang arXiv preprint arXiv:2406.17272, 2024 | | 2024 |