Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech C Huang, KH Lu, SH Wang, CY Hsiao, CY Kuan, H Wu, S Arora, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 28 | 2024 |
Investigating zero-shot generalizability on mandarin-english code-switched asr and speech-to-text translation of recent foundation models with self-supervision and weak supervision CK Yang, KP Huang, KH Lu, CY Kuan, CY Hsiao, H Lee 2024 IEEE International Conference on Acoustics, Speech, and Signal …, 2024 | 7 | 2024 |
Towards General-Purpose Text-Instruction-Guided Voice Conversion CY Kuan, CA Li, TY Hsu, TY Lin, HL Chung, KW Chang, SY Chang, H Lee 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 6 | 2023 |
Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation CY Kuan, CK Yang, WP Huang, KH Lu, H Lee arXiv preprint arXiv:2407.09886, 2024 | 2 | 2024 |
Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models CY Kuan, WP Huang, H Lee arXiv preprint arXiv:2406.08402, 2024 | 2 | 2024 |
Dynamic-superb phase-2: A collaboratively expanding benchmark for measuring the capabilities of spoken language models with 180 tasks C Huang, WC Chen, S Yang, AT Liu, CA Li, YX Lin, WC Tseng, A Diwan, ... arXiv preprint arXiv:2411.05361, 2024 | 1 | 2024 |
Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course CH Chiang, WC Chen, CY Kuan, C Yang, H Lee arXiv preprint arXiv:2407.05216, 2024 | 1 | 2024 |
Building a Taiwanese Mandarin Spoken Language Model: A First Attempt CK Yang, YK Fu, CA Li, YC Lin, YX Lin, WC Chen, HL Chung, CY Kuan, ... arXiv preprint arXiv:2411.07111, 2024 | | 2024 |
Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio Reasoning CY Kuan, H Lee arXiv preprint arXiv:2410.16130, 2024 | | 2024 |
Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models YC Lin, TQ Lin, CK Yang, KH Lu, WC Chen, CY Kuan, H Lee arXiv preprint arXiv:2407.06957, 2024 | | 2024 |