Mustango: Toward controllable text-to-music generation J Melechovsky, Z Guo, D Ghosal, N Majumder, D Herremans, S Poria arXiv preprint arXiv:2311.08355, 2023 | 58 | 2023 |
Comparison of automated acoustic methods for oral diadochokinesis assessment in amyotrophic lateral sclerosis M Novotny, J Melechovsky, K Rozenstoks, T Tykalova, P Kryze, M Kanok, ... Journal of Speech, Language, and Hearing Research 63 (10), 3453-3460, 2020 | 21 | 2020 |
Alzheimer’s dementia speech (audio vs. text): Multi-modal machine learning at high vs. low resolution P Priyadarshinee, CJ Clarke, J Melechovsky, CMY Lin, B BT, JM Chen Applied Sciences 13 (7), 4244, 2023 | 12 | 2023 |
Accented text-to-speech synthesis with a conditional variational autoencoder J Melechovsky, A Mehrish, B Sisman, D Herremans TENCON 2024-2024 IEEE Region 10 Conference (TENCON), 343-346, 2024 | 6 | 2024 |
Learning accent representation with multi-level vae towards controllable speech synthesis J Melechovsky, A Mehrish, D Herremans, B Sisman 2022 IEEE Spoken Language Technology Workshop (SLT), 928-935, 2023 | 6 | 2023 |
MidiCaps: A large-scale MIDI dataset with text captions J Melechovsky, A Roy, D Herremans arXiv preprint arXiv:2406.02255, 2024 | 4 | 2024 |
Drum Kit sound localization tests on binaural hearing model with ANN J Melechovský, J Bouše, F Rund, E Koshkina 2018 28th International Conference Radioelektronika (RADIOELEKTRONIKA), 1-5, 2018 | 2 | 2018 |
Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training J Melechovsky, A Mehrish, B Sisman, D Herremans arXiv preprint arXiv:2406.01018, 2024 | 1 | 2024 |
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech J Melechovsky, A Mehrish, B Sisman, D Herremans arXiv preprint arXiv:2410.13342, 2024 | | 2024 |
ADDRESSING MULTI-MODAL MULTI-MODEL MULTI-FEATURE CUES IN ALZHEIMER’S DEMENTIA CJ Clarke, J Melechovsky, CMY Lin, P Priyadarshinee, BT Balamurali, ... | | 2022 |