Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech C Huang, KH Lu, SH Wang, CY Hsiao, CY Kuan, H Wu, S Arora, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 27 | 2024 |
Non-autoregressive asr modeling using pre-trained language models for chinese speech recognition FH Yu, KY Chen, KH Lu IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1474-1482, 2022 | 26 | 2022 |
A context-aware knowledge transferring strategy for CTC-based ASR KH Lu, KY Chen 2022 IEEE Spoken Language Technology Workshop (SLT), 60-67, 2023 | 14 | 2023 |
Investigating zero-shot generalizability on mandarin-english code-switched asr and speech-to-text translation of recent foundation models with self-supervision and weak supervision CK Yang, KP Huang, KH Lu, CY Kuan, CY Hsiao, H Lee 2024 IEEE International Conference on Acoustics, Speech, and Signal …, 2024 | 6 | 2024 |
Desta: Enhancing speech language models through descriptive speech-text alignment KH Lu, Z Chen, SW Fu, H Huang, B Ginsburg, YCF Wang, H Lee INTERSPEECH, 2024 | 5 | 2024 |
Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation CY Kuan, CK Yang, WP Huang, KH Lu, H Lee arXiv preprint arXiv:2407.09886, 2024 | 2 | 2024 |
Hypr: A comprehensive study for ASR hypothesis revising with a reference corpus YW Wang, KH Lu, KY Chen arXiv preprint arXiv:2309.09838, 2023 | 2 | 2023 |
A transformer-based cross-modal fusion model with adversarial training for vqa challenge 2021 KH Lu, BH Fang, KY Chen arXiv preprint arXiv:2106.13033, 2021 | 2 | 2021 |
SpeechCaps: Advancing Instruction-Based Universal Speech Models with Multi-Talker Speaking Style Captioning C Huang, MH Shih, KH Lu, CY Hsiao, H Lee arXiv preprint arXiv:2408.13891, 2024 | 1 | 2024 |
Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data KH Lu, Z Chen, SW Fu, CHH Yang, J Balam, B Ginsburg, YCF Wang, ... arXiv preprint arXiv:2409.20007, 2024 | | 2024 |
Codec-SUPERB@ SLT 2024: A lightweight benchmark for neural audio codec models H Wu, X Chen, YC Lin, K Chang, J Du, KH Lu, AH Liu, HL Chung, YK Wu, ... arXiv preprint arXiv:2409.14085, 2024 | | 2024 |
Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models YC Lin, TQ Lin, CK Yang, KH Lu, WC Chen, CY Kuan, H Lee arXiv preprint arXiv:2407.06957, 2024 | | 2024 |
ntust-nlp-2 at ROCLING-2021 Shared Task: BERT-based semantic analyzer with word-level information KH Lu, KY Chen Proceedings of the 33rd Conference on Computational Linguistics and Speech …, 2021 | | 2021 |
2020 福爾摩沙臺語語音辨識比賽之初步實驗 (A Preliminary Study of Formosa Speech Recognition Challenge 2020–Taiwanese ASR) FH Yu, KH Lu, YW Wang, WZ Chang, WK Huang, KY Chen International Journal of Computational Linguistics & Chinese Language …, 2021 | | 2021 |