追蹤
Sung-Feng Huang
Sung-Feng Huang
在 ntu.edu.tw 的電子郵件地址已通過驗證
標題
引用次數
引用次數
年份
Meta-tts: Meta-learning for few-shot speaker adaptive text-to-speech
SF Huang, CJ Lin, DR Liu, YC Chen, H Lee
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1558-1571, 2022
492022
Audio word2vec: Sequence-to-sequence autoencoding for unsupervised learning of audio segmentation and representation
YC Chen, SF Huang, H Lee, YH Wang, CH Shen
IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (9), 1481 …, 2019
412019
Phonetic-and-semantic embedding of spoken words with applications in spoken content retrieval
YC Chen, SF Huang, CH Shen, HY Lee, LS Lee
2018 IEEE Spoken Language Technology Workshop (SLT), 941-948, 2018
382018
Pretrained language model embryology: The birth of ALBERT
CH Chiang, SF Huang, H Lee
arXiv preprint arXiv:2010.02480, 2020
302020
Stabilizing label assignment for speech separation by self-supervised pre-training
SF Huang, SP Chuang, DR Liu, YC Chen, GP Yang, H Lee
arXiv preprint arXiv:2010.15366, 2020
23*2020
Towards unsupervised automatic speech recognition trained by unaligned speech and text only
YC Chen, CH Shen, SF Huang, H Lee
arXiv preprint arXiv:1803.10952, 2018
182018
Almost-unsupervised speech recognition with close-to-zero resource based on phonetic structures learned from very small unpaired speech and text data
YC Chen, CH Shen, SF Huang, H Lee, L Lee
arXiv preprint arXiv:1810.12566, 2018
132018
Speechnet: A universal modularized model for speech processing tasks
YC Chen, PH Chi, S Yang, KW Chang, J Lin, SF Huang, DR Liu, CL Liu, ...
arXiv preprint arXiv:2105.03070, 2021
112021
Non-autoregressive mandarin-english code-switching speech recognition
SP Chuang, HJ Chang, SF Huang, H Lee
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
102021
Learning phone recognition from unpaired audio and phone sequences based on generative adversarial network
D Liu, P Hsu, Y Chen, S Huang, S Chuang, D Wu, H Lee
IEEE/ACM transactions on audio, speech, and language processing 30, 230-243, 2021
72021
Improved audio embeddings by adjacency-based clustering with applications in spoken term detection
SF Huang, YC Chen, H Lee, L Lee
arXiv preprint arXiv:1811.02775, 2018
72018
From semi-supervised to almost-unsupervised speech recognition with very-low resource by jointly learning phonetic structures from audio and text embeddings
YC Chen, SF Huang, H Lee, L Lee
arXiv preprint arXiv:1904.05078, 2019
22019
Few-shot cross-lingual tts using transferable phoneme embedding
WP Huang, PC Chen, SF Huang, H Lee
arXiv preprint arXiv:2206.15427, 2022
12022
Maximizing Data Efficiency for Cross-Lingual TTS Adaptation by Self-Supervised Representation Mixing and Embedding Initialization
WP Huang, SF Huang, H Lee
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
2023
Personalized Lightweight Text-to-Speech: Voice Cloning with Adaptive Structured Pruning
SF Huang, CP Chen, ZS Chen, YP Tsai, H Lee
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
2023
系統目前無法執行作業,請稍後再試。
文章 1–15