Follow
Hsin-Min Wang
Hsin-Min Wang
Research Fellow/Professor, Institute of Information Sience, Academia Sinica
Verified email at iis.sinica.edu.tw - Homepage
Title
Cited by
Cited by
Year
Voice Conversion from Unaligned Corpora using Variational Autoencoding Wasserstein Generative Adversarial Networks
CC Hsu, HT Hwang, YC Wu, Y Tsao, HM Wang
Interspeech2017, 2017
4592017
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ...
Computer Speech & Language 64, 101114, 2020
4042020
Voice conversion from non-parallel corpora using variational auto-encoder
CC Hsu, HT Hwang, YC Wu, Y Tsao, HM Wang
2016 Asia-Pacific Signal and Information Processing Association Annual …, 2016
3712016
MOSNet: Deep learning based objective assessment for voice conversion
CC Lo, SW Fu, WC Huang, X Wang, J Yamagishi, Y Tsao, HM Wang
Interspeech2019, 2019
3092019
Audio-visual speech enhancement using multimodal deep convolutional neural networks
JC Hou, SS Wang, YH Lai, Y Tsao, HW Chang, HM Wang
IEEE Transactions on Emerging Topics in Computational Intelligence 2 (2 …, 2018
2872018
A distributed architecture for cooperative spoken dialogue agents with coherent dialogue state and history
B Lin, H Wang, L Lee
IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU'99), 1999
2541999
Fluent speech prosody: Framework and modeling
C Tseng, SH Pin, Y Lee, HM Wang, YC Chen
Speech Communication 46 (3), 284-309, 2005
2262005
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM
SW Fu, Y Tsao, HT Hwang, HM Wang
Interspeech2018, 2018
205*2018
MATBN: A Mandarin Chinese broadcast news corpus
HM Wang, B Chen, JW Kuo, SS Cheng
International Journal of Computational Linguistics and Chinese Language …, 2005
1522005
Frameworks for recognition of Mandarin syllables with tones using sub-syllabic units
CH Lin, CH Wu, PY Ting, HM Wang
Speech Communication 18 (2), 175-190, 1996
144*1996
Fast speaker adaptation using eigenspace-based maximum likelihood linear regression
KT Chen, WW Liau, HM Wang, LS Lee
ICSLP 2010, 742-745, 2000
1432000
Automatic singer recognition of popular music recordings via estimation and modeling of solo vocal signals
WH Tsai, HM Wang
IEEE Transactions on Audio, Speech, and Language Processing 14 (1), 330-341, 2005
1412005
Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data
H Wang, TH Ho, RC Yang, JL Shen, BR Bai, JC Hong, WP Chen, TL Yu, ...
IEEE Transactions on Speech and Audio Processing 5 (2), 195-200, 1997
1241997
Cost-sensitive multi-label learning for audio tag annotation and retrieval
HY Lo, JC Wang, HM Wang, SD Lin
IEEE Transactions on Multimedia 13 (3), 518-529, 2011
1222011
The voicemos challenge 2022
WC Huang, E Cooper, Y Tsao, HM Wang, T Toda, J Yamagishi
Interspeech2022, 2022
1202022
Golden Mandarin (II)-an improved single-chip real-time Mandarin dictation machine for Chinese language with very large vocabulary
LS Lee, CY Tseng, KJ Chen, IJ Hung, MY Lee, LF Chien, Y Lee, R Lyu, ...
1993 IEEE International Conference on Acoustics, Speech, and Signal …, 1993
1201993
Mandarin–English information (MEI): investigating translingual speech retrieval
HM Meng, B Chen, S Khudanpur, GA Levow, WK Lo, D Oard, P Schone, ...
Computer Speech & Language 18 (2), 163-179, 2004
922004
A chatbot using LSTM-based multi-layer embedding for elderly care
MH Su, CH Wu, KY Huang, QB Hong, HM Wang
2017 international conference on orange technologies (ICOT), 70-74, 2017
892017
Golden Mandarin (II)-an intelligent Mandarin dictation machine for Chinese character input with adaptation/learning functions
LS Lee, KJ Chen, CY Tseng, R Lyu, LF Chien, HM Wang, JL Shen, SC Lin, ...
International Conference on Speech, Image Processing and Neural Networks …, 1994
861994
Deep learning-based non-intrusive multi-objective speech assessment model with cross-domain features
RE Zezario, SW Fu, F Chen, CS Fuh, HM Wang, Y Tsao
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 54-70, 2022
802022
The system can't perform the operation now. Try again later.
Articles 1–20