Speech emotion recognition using self-supervised features E Morais, R Hoory, W Zhu, I Gat, M Damasceno, H Aronowitz ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 133 | 2022 |
IBM MASTOR: Multilingual automatic speech-to-speech translator Y Gao, B Zhou, L Gu, R Sarikaya, HK Kuo, AVI Rosti, M Afify, W Zhu 2006 IEEE International Conference on Acoustics Speech and Signal Processing …, 2006 | 86 | 2006 |
Speaker normalization for self-supervised speech emotion recognition I Gat, H Aronowitz, W Zhu, E Morais, R Hoory ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 58 | 2022 |
Online Speaker Diarization using Adapted I-Vector Transforms W Zhu, J Pelecanos ICASSP 2016, 2016 | 53 | 2016 |
Nearest neighbor discriminant analysis for robust speaker recognition. SO Sadjadi, JW Pelecanos, W Zhu INTERSPEECH, 1860-1864, 2014 | 37 | 2014 |
Log-energy dynamic range normalization for robust speech recognition W Zhu, D O'Shaughnessy Proceedings.(ICASSP'05). IEEE International Conference on Acoustics, Speech …, 2005 | 32 | 2005 |
Incorporating frequency masking filtering in a standard MFCC feature extraction algorithm W Zhu, D O'Shaughnessy Proceedings 7th International Conference on Signal Processing, 2004 …, 2004 | 28 | 2004 |
New Advances in Speaker Diarization. H Aronowitz, W Zhu, M Suzuki, G Kurata, R Hoory Interspeech, 279-283, 2020 | 21 | 2020 |
Using noise reduction and spectral emphasis techniques to improve ASR performance in noisy conditions W Zhu, D O'Shaughnessy 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE …, 2003 | 14 | 2003 |
Context and uncertainty modeling for online speaker change detection H Aronowitz, W Zhu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 12 | 2020 |
Role modeling in call centers and work centers KW Church, JW Pelecanos, J Vopicka, W Zhu US Patent 10,147,438, 2018 | 10 | 2018 |
A bayesian attention neural network layer for speaker recognition W Zhu, J Pelecanos ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 9 | 2019 |
Nearest neighbor based i-vector normalization for robust speaker recognition under unseen channel conditions W Zhu, SO Sadjadi, JW Pelecanos 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 7 | 2015 |
The IBM RATS phase II speaker recognition system: overview and analysis. W Zhu, S Yaman, JW Pelecanos Interspeech, 3137-3141, 2013 | 5 | 2013 |
Unifying PLDA and polynomial kernel SVMs S Yaman, J Pelecanos, W Zhu 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 5 | 2013 |
Forensically inspired approaches to automatic speaker recognition KJ Han, MK Omar, J Pelecanos, C Pendus, S Yaman, W Zhu 2011 IEEE International Conference on Acoustics, Speech and Signal …, 2011 | 5 | 2011 |
Recent Advances of IBM's Handheld Speech Translation System W Zhu, B Zhou, C Prosser, P Krbec, Y Gao Ninth International Conference on Spoken Language Processing, 2006 | 5 | 2006 |
Towards a common speech analysis engine H Aronowitz, I Gat, E Morais, W Zhu, R Hoory ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 3 | 2022 |
Adaptive selection of message data properties for improving communication throughput and reliability KW Church, M Franz, NS Kersting, JS McCarley, JW Pelecanos, W Zhu US Patent 10,305,765, 2019 | 3 | 2019 |
Handheld speech to speech translation system ZH Tan, B Lindberg, Y Gao, B Zhou, W Zhu, W Zhang Automatic Speech Recognition on Mobile Devices and over Communication …, 2008 | 3 | 2008 |