Superb: Speech processing universal performance benchmark S Yang, PH Chi, YS Chuang, CIJ Lai, K Lakhotia, YY Lin, AT Liu, J Shi, ... arXiv preprint arXiv:2105.01051, 2021 | 911 | 2021 |
Utilizing self-supervised representations for MOS prediction WC Tseng, C Huang, WT Kao, YY Lin, H Lee arXiv preprint arXiv:2104.03017, 2021 | 63 | 2021 |
Speechprompt: An exploration of prompt tuning on generative spoken language model for speech processing tasks KW Chang, WC Tseng, SW Li, H Lee arXiv preprint arXiv:2203.16773, 2022 | 53 | 2022 |
Offline multi-agent reinforcement learning with knowledge distillation WC Tseng, THJ Wang, YC Lin, P Isola Advances in Neural Information Processing Systems 35, 226-237, 2022 | 40 | 2022 |
Speechprompt v2: Prompt tuning for speech classification tasks KW Chang, YK Wang, H Shen, I Kang, WC Tseng, SW Li, H Lee arXiv preprint arXiv:2303.00733, 2023 | 32 | 2023 |
DDOS: A MOS prediction framework utilizing domain adaptive pre-training and distribution of opinion scores WC Tseng, WT Kao, H Lee arXiv preprint arXiv:2204.03219, 2022 | 20 | 2022 |
Ensemble knowledge distillation of self-supervised speech models KP Huang, TH Feng, YK Fu, TY Hsu, PC Yen, WC Tseng, KW Chang, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 17 | 2023 |
Membership inference attacks against self-supervised speech models WC Tseng, WT Kao, H Lee arXiv preprint arXiv:2111.05113, 2021 | 16 | 2021 |
A Large-Scale Evaluation of Speech Foundation Models S Yang, HJ Chang, Z Huang, AT Liu, CI Lai, H Wu, J Shi, X Chang, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 11 | 2024 |
Dynamic-superb phase-2: A collaboratively expanding benchmark for measuring the capabilities of spoken language models with 180 tasks C Huang, WC Chen, S Yang, AT Liu, CA Li, YX Lin, WC Tseng, A Diwan, ... arXiv preprint arXiv:2411.05361, 2024 | 1 | 2024 |
Speechprompt: Prompting speech language models for speech processing tasks KW Chang, H Wu, YK Wang, YK Wu, H Shen, WC Tseng, I Kang, SW Li, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 1 | 2024 |
Measuring Sound Symbolism in Audio-visual Models WC Tseng, YJ Shih, D Harwath, R Mooney arXiv preprint arXiv:2409.12306, 2024 | | 2024 |
VMCML: Video and Music Matching via Cross-Modality Lifting YS Lee, WC Tseng, FE Wang, M Sun Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | | 2024 |