追蹤
Siddharth Dalmia
Siddharth Dalmia
其他名字Sid Dalmia
Research Scientist, Google DeepMind
在 google.com 的電子郵件地址已通過驗證 - 首頁
標題
引用次數
引用次數
年份
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
8752024
FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech
A Conneau, M Ma, S Khanuja, Y Zhang, V Axelrod, S Dalmia, J Riesa, ...
SLT 2022, 2022
2602022
Epitran: Precision G2P for Many Languages
DR Mortensen, S Dalmia, P Littell
LREC 2018, 2018
1682018
Branchformer: Parallel mlp-attention architectures to capture local and global context for speech recognition and understanding
Y Peng, S Dalmia, I Lane, S Watanabe
ICML 2022, 17627-17643, 2022
1662022
Universal phone recognition with a multilingual allophone system
X Li, S Dalmia, J Li, M Lee, P Littell, J Yao, A Anastasopoulos, ...
ICASSP 2020, 2020
1452020
Sequence-based Multi-lingual Low Resource Speech Recognition
S Dalmia, R Sanabria, F Metze, AW Black
ICASSP 2018, 2018
1172018
Espnet-slu: Advancing spoken language understanding through espnet
S Arora, S Dalmia, P Denisov, X Chang, Y Ueda, Y Peng, Y Zhang, ...
ICASSP 2022, 7167-7171, 2022
792022
Robust ASR using neural network based speech enhancement and feature simulation
S Sivasankaran, AA Nugraha, E Vincent, JA Morales-Cordovilla, S Dalmia, ...
ASRU 2015, 2015
542015
Transformer-Transducers for Code-Switched Speech Recognition
S Dalmia, Y Liu, S Ronanki, K Kirchhoff
ICASSP 2021, 2021
532021
Towards Zero-shot Learning for Automatic Phonemic Transcription
X Li, S Dalmia, DR Mortensen, J Li, AW Black, F Metze
AAAI 2020, 2020
40*2020
Llm augmented llms: Expanding capabilities through composition
R Bansal, B Samanta, S Dalmia, N Gupta, S Vashishth, S Ganapathy, ...
arXiv preprint arXiv:2401.02412, 2024
352024
CTC alignments improve autoregressive translation
B Yan, S Dalmia, Y Higuchi, G Neubig, F Metze, AW Black, S Watanabe
EACL 2023, 2022
342022
On Long-Tailed Phenomena in Neural Machine Translation
V Raunak, S Dalmia, V Gupta, F Metze
EMNLP 2020 Findings, 2020
342020
Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks
S Dalmia, B Yan, V Raunak, F Metze, S Watanabe
NAACL 2021, arXiv: 2105.00573, 2021
322021
NoiseQA: Challenge set evaluation for user-centric question answering
A Ravichander, S Dalmia, M Ryskina, F Metze, E Hovy, AW Black
EACL 2021, 2021
312021
Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion
S Kim, S Dalmia, F Metze
ACL 2019, 2019
302019
An approach for self-training audio event detectors using web data
B Elizalde, A Shah, S Dalmia, MH Lee, R Badlani, A Kumar, B Raj, I Lane
EUSIPCO 2017, 2017
29*2017
Multilingual Speech Recognition with Corpus Relatedness Sampling
X Li, S Dalmia, AW Black, F Metze
InterSpeech 2019, 2019
252019
A Study on the Integration of Pre-trained SSL, ASR, LM and SLU Models for Spoken Language Understanding
Y Peng, S Arora, Y Higuchi, Y Ueda, S Kumar, K Ganesan, S Dalmia, ...
SLT 2022, 2022
242022
Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
B Yan, C Zhang, M Yu, SX Zhang, S Dalmia, D Berrebbi, C Weng, ...
ICASSP 2022, 6412-6416, 2022
222022
系統目前無法執行作業,請稍後再試。
文章 1–20