Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 2414 | 2023 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 909 | 2024 |
Description-driven task-oriented dialog modeling J Zhao, R Gupta, Y Cao, D Yu, M Wang, H Lee, A Rastogi, I Shafran, Y Wu arXiv preprint arXiv:2201.08904, 2022 | 58 | 2022 |
Slm: Bridge the thin gap between speech and text foundation models M Wang, W Han, I Shafran, Z Wu, CC Chiu, Y Cao, N Chen, Y Zhang, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 38 | 2023 |
Learning to infer entities, properties and their relations from clinical conversations N Du, M Wang, L Tran, G Li, I Shafran arXiv preprint arXiv:1908.11536, 2019 | 30 | 2019 |
Systematic kinetic analysis on monolayer lamellar crystal thickening via chain-sliding diffusion of polymers M Wang, H Gao, L Zha, EQ Chen, W Hu Macromolecules 46 (1), 164-171, 2013 | 23 | 2013 |
The medical scribe: corpus development and model performance analyses I Shafran, N Du, L Tran, A Perry, L Keyes, M Knichel, A Domin, L Huang, ... arXiv preprint arXiv:2003.11531, 2020 | 16 | 2020 |
Unsupervised slot schema induction for task-oriented dialog D Yu, M Wang, Y Cao, I Shafran, LE Shafey, H Soltau arXiv preprint arXiv:2205.04515, 2022 | 14 | 2022 |
Understanding medical conversations: Rich transcription, confidence scores & information extraction H Soltau, M Wang, I Shafran, LE Shafey arXiv preprint arXiv:2104.02219, 2021 | 11 | 2021 |
Anytod: A programmable task-oriented dialog system J Zhao, Y Cao, R Gupta, H Lee, A Rastogi, M Wang, H Soltau, I Shafran, ... arXiv preprint arXiv:2212.09939, 2022 | 10 | 2022 |
Evolution of multivalent nanoparticle adhesion via specific molecular interactions M Wang, SR Ravindranath, MK Rahim, EL Botvinick, JB Haun Langmuir 32 (49), 13124-13136, 2016 | 10 | 2016 |
Retrieval Augmented End-to-End Spoken Dialog Models M Wang, I Shafran, H Soltau, W Han, Y Cao, D Yu, L El Shafey ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 9 | 2024 |
Speech aware dialog system technology challenge (dstc11) H Soltau, I Shafran, M Wang, A Rastogi, J Zhao, Y Jia, W Han, Y Cao, ... arXiv preprint arXiv:2212.08704, 2022 | 9 | 2022 |
Speech-to-text adapter and speech-to-entity retriever augmented llms for speech understanding M Wang, I Shafran, H Soltau, W Han, Y Cao, D Yu, LE Shafey arXiv preprint arXiv:2306.07944, 2023 | 7 | 2023 |
Mux-plms: Pre-training language models with data multiplexing V Murahari, A Deshpande, C Jimenez, I Shafran, M Wang, Y Cao, ... Proceedings of the 8th Workshop on Representation Learning for NLP (RepL4NLP …, 2023 | 6 | 2023 |
Word-level confidence estimation for RNN transducers M Wang, H Soltau, L El Shafey, I Shafran 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 6 | 2021 |
Knowledge-grounded dialog state tracking D Yu, M Wang, Y Cao, I Shafran, LE Shafey, H Soltau arXiv preprint arXiv:2210.06656, 2022 | 5 | 2022 |
RNN Transducers for Nested Named Entity Recognition with constraints on alignment for long sequences H Soltau, I Shafran, M Wang, LE Shafey arXiv preprint arXiv:2203.03543, 2022 | 5 | 2022 |
DSTC-11: Speech aware task-oriented dialog modeling track H Soltau, I Shafran, M Wang, A Rastogi, W Han, Y Cao Proceedings of The Eleventh Dialog System Technology Challenge, 226-234, 2023 | 2 | 2023 |
RNN Transducers for Named Entity Recognition with constraints on alignment for understanding medical conversations. H Soltau, I Shafran, M Wang, L El Shafey INTERSPEECH, 1901-1905, 2022 | 1 | 2022 |