追蹤
Wen-Chin Huang
Wen-Chin Huang
在 g.sp.m.is.nagoya-u.ac.jp 的電子郵件地址已通過驗證 - 首頁
標題
引用次數
引用次數
年份
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ...
Computer Speech & Language 64, 101114, 2020
3822020
Mosnet: Deep learning based objective assessment for voice conversion
CC Lo, SW Fu, WC Huang, X Wang, J Yamagishi, Y Tsao, HM Wang
arXiv preprint arXiv:1904.08352, 2019
2932019
Voice Conversion Challenge 2020–-Intra-lingual semi-parallel and cross-lingual voice conversion–-}}
Z Yi, WC Huang, X Tian, J Yamagishi, RK Das, T Kinnunen, Z Ling, ...
Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion …, 2020
231*2020
Generalization ability of MOS prediction networks
E Cooper, WC Huang, T Toda, J Yamagishi
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
1402022
The voicemos challenge 2022
WC Huang, E Cooper, Y Tsao, HM Wang, T Toda, J Yamagishi
arXiv preprint arXiv:2203.11389, 2022
1112022
Voice transformer network: Sequence-to-sequence voice conversion using transformer with text-to-speech pretraining
WC Huang, T Hayashi, YC Wu, H Kameoka, T Toda
arXiv preprint arXiv:1912.06813, 2019
1102019
SUPERB-SG: Enhanced speech processing universal performance benchmark for semantic and generative capabilities
HS Tsai, HJ Chang, WC Huang, Z Huang, K Lakhotia, S Yang, S Dong, ...
arXiv preprint arXiv:2203.06849, 2022
932022
Ldnet: Unified listener dependent modeling in mos prediction for synthetic speech
WC Huang, E Cooper, J Yamagishi, T Toda
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
682022
Predictions of subjective ratings and spoofing assessments of voice conversion challenge 2020 submissions
RK Das, T Kinnunen, WC Huang, Z Ling, J Yamagishi, Y Zhao, X Tian, ...
arXiv preprint arXiv:2009.03554, 2020
592020
The 2020 espnet update: new features, broadened applications, performance improvements, and future plans
S Watanabe, F Boyer, X Chang, P Guo, T Hayashi, Y Higuchi, T Hori, ...
2021 IEEE Data Science and Learning Workshop (DSLW), 1-6, 2021
552021
Voice conversion based on cross-domain features using variational auto encoders
WC Huang, HT Hwang, YH Peng, Y Tsao, HM Wang
2018 11th International Symposium on Chinese Spoken Language Processing …, 2018
532018
Unsupervised representation disentanglement using cross domain features and adversarial learning in variational autoencoder based voice conversion
WC Huang, H Luo, HT Hwang, CC Lo, YH Peng, Y Tsao, HM Wang
IEEE Transactions on Emerging Topics in Computational Intelligence 4 (4 …, 2020
502020
S3prl-vc: Open-source voice conversion framework with self-supervised speech representations
WC Huang, SW Yang, T Hayashi, HY Lee, S Watanabe, T Toda
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
462022
The singing voice conversion challenge 2023
WC Huang, LP Violeta, S Liu, J Shi, T Toda
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
452023
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
WC Huang, T Hayashi, YC Wu, H Kameoka, T Toda
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 745 - 755, 2021
452021
The sequence-to-sequence baseline for the voice conversion challenge 2020: Cascading asr and tts
WC Huang, T Hayashi, S Watanabe, T Toda
arXiv preprint arXiv:2010.02434, 2020
452020
Any-to-one sequence-to-sequence voice conversion using self-supervised discrete speech representations
WC Huang, YC Wu, T Hayashi
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
372021
Many-to-many voice transformer network
H Kameoka, WC Huang, K Tanaka, T Kaneko, N Hojo, T Toda
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 656-670, 2020
372020
Speech recognition by simply fine-tuning BERT
WC Huang, CH Wu, SB Luo, KY Chen, HM Wang, T Toda
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
332021
Investigating self-supervised pretraining frameworks for pathological speech recognition
LP Violeta, WC Huang, T Toda
arXiv preprint arXiv:2203.15431, 2022
322022
系統目前無法執行作業,請稍後再試。
文章 1–20