Modeling Context Between Objects for Referring Expression Understanding VK Nagaraja, VI Morariu, LS Davis European Conference on Computer Vision (ECCV), 2016 | 496 | 2016 |
Compressed Time Delay Neural Network for Small-Footprint Keyword Spotting. M Sun, D Snyder, Y Gao, VK Nagaraja, M Rodehorst, S Panchapagesan, ... INTERSPEECH, 2017 | 139 | 2017 |
Model shrinking for embedded keyword spotting M Sun, B Hoffmeister, SNP Vitaladevuni, VK Nagaraja US Patent 9,600,231, 2017 | 42 | 2017 |
Model Shrinking for Embedded Keyword Spotting M Sun, VK Nagaraja, S Vitaladevuni International Conference on Machine Learning and Applications (ICMLA), 2015 | 26 | 2015 |
Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution Y Shi, C Wu, D Wang, A Xiao, J Mahadeokar, X Zhang, C Liu, K Li, ... International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022 | 17 | 2022 |
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency Y Shi, V Nagaraja, C Wu, J Mahadeokar, D Le, R Prabhavalkar, A Xiao, ... INTERSPEECH, 2021 | 16 | 2021 |
Collaborative Training of Acoustic Encoders for Speech Recognition V Nagaraja, Y Shi, G Venkatesh, O Kalinli, ML Seltzer, V Chandra INTERSPEECH, 2021 | 11 | 2021 |
Feature Selection using Partial Least Squares Regression and Optimal Experiment Design VK Nagaraja, W Abd-Almageed International Joint Conference on Neural Networks (IJCNN), 2015 | 11 | 2015 |
Foleygen: Visually-guided audio generation X Mei, V Nagaraja, G Le Lan, Z Ni, E Chang, Y Shi, V Chandra International Workshop on Machine Learning for Signal Processing (MLSP), 2024 | 9 | 2024 |
Stack-and-Delay: A New Codebook Pattern for Music Generation GL Lan, V Nagaraja, E Chang, D Kant, Z Ni, Y Shi, F Iandola, V Chandra International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024 | 8 | 2024 |
Searching for Objects using Structure in Indoor Scenes VK Nagaraja, VI Morariu, LS Davis British Machine Vision Conference (BMVC), 2015 | 8 | 2015 |
Speech processing using a recurrent neural network G Fu, T Senechal, SNP Vitaladevuni, MJ Rodehorst, VK Nagaraja US Patent 11,205,420, 2021 | 7 | 2021 |
High fidelity text-guided music generation and editing via single-stage flow matching G Le Lan, B Shi, Z Ni, S Srinivasan, A Kumar, B Ellis, D Kant, V Nagaraja, ... arXiv e-prints, arXiv: 2407.03648, 2024 | 4 | 2024 |
On the Open Prompt Challenge in Conditional Audio Generation E Chang, S Srinivasan, M Luthra, PJ Lin, V Nagaraja, F Iandola, Z Liu, ... International Conference on Acoustics, Speech and Signal Processing (ICASSP …, 2024 | 4 | 2024 |
Wakeword detection using multi-word model Y Gao, M Sun, V Nagaraja, G Fu, C Wang, SNP Vitaladevuni US Patent 11,308,939, 2022 | 3 | 2022 |
Feedback Loop Between High Level Semantics and Low Level Vision VK Nagaraja, VI Morariu, LS Davis European Conference on Computer Vision Workshops (ECCVW), 2014 | 1 | 2014 |
SyncFlow: Toward Temporally Aligned Joint Audio-Video Generation from Text H Liu, GL Lan, X Mei, Z Ni, A Kumar, V Nagaraja, W Wang, MD Plumbley, ... arXiv preprint arXiv:2412.15220, 2024 | | 2024 |
Mixed reality system with acoustic event detection, adaptive anchors for object placement, and improved led design M Keshavarzi, B Zhang, Y Wang, Z Yang, Y Shi, VK Nagaraja, G Le Lan, ... US Patent App. 18/642,606, 2024 | | 2024 |
Towards Temporally Synchronized Visually Indicated Sounds Through Scale-Adapted Positional Embeddings X Mei, G Le Lan, H Liu, Z Ni, VK Nagaraja, A Kumar, Y Shi, V Chandra Audio Imagination: NeurIPS 2024 Workshop AI-Driven Speech, Music, and Sound …, 2024 | | 2024 |
Enhance Audio Generation Controllability Through Representation Similarity Regularization Y Shi, GL Lan, V Nagaraja, Z Ni, X Mei, E Chang, F Iandola, Y Liu, ... arXiv preprint arXiv:2309.08773, 2023 | | 2023 |