Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models with Self-Supervision and Weak Supervision CK Yang, KP Huang, KH Lu, CY Kuan, CY Hsiao, H Lee 2024 IEEE International Conference on Acoustics, Speech, and Signal …, 2023 | 10 | 2023 |
Zero resource code-switched speech benchmark using speech utterance pairs for multiple spoken languages KP Huang*, CK Yang*, YK Fu, E Dunbar, H Lee ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 7 | 2024 |
Dynamic-superb phase-2: A collaboratively expanding benchmark for measuring the capabilities of spoken language models with 180 tasks C Huang, WC Chen, S Yang, AT Liu, CA Li, YX Lin, WC Tseng, A Diwan, ... arXiv preprint arXiv:2411.05361, 2024 | 3 | 2024 |
Speech-Copilot: Leveraging Large Language Models for Speech Processing Via Task Decomposition, Modularization, and Program Generation CY Kuan*, CK Yang*, WP Huang, KH Lu, H Lee 2024 IEEE Spoken Language Technology Workshop (SLT), 1060-1067, 2024 | 2 | 2024 |
Do Prompts Really Prompt? Exploring the Prompt Understanding Capability of Whisper CK Yang, KP Huang, H Lee arXiv preprint arXiv:2406.05806, 2024 | 2 | 2024 |
Listen and Speak Fairly: a Study on Semantic Gender Bias in Speech Integrated Large Language Models YC Lin, TQ Lin*, CK Yang*, KH Lu*, WC Chen*, CY Kuan*, H Lee 2024 IEEE Spoken Language Technology Workshop (SLT), 439-446, 2024 | | 2024 |
Building a Taiwanese Mandarin Spoken Language Model: A First Attempt CK Yang, YK Fu, CA Li, YC Lin, YX Lin, WC Chen, HL Chung, CY Kuan, ... arXiv preprint arXiv:2411.07111, 2024 | | 2024 |