追蹤
Ahmet Ustun
Ahmet Ustun
Cohere For AI
在 cohere.com 的電子郵件地址已通過驗證 - 首頁
標題
引用次數
引用次數
年份
UDapter: Language Adaptation for Truly Universal Dependency Parsing
A Üstün, A Bisazza, G Bouma, G van Noord
Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020
1152020
Massive choice, ample tasks (MaChAmp): A toolkit for multi-task learning in NLP
R van der Goot, A Üstün, A Ramponi, I Sharaf, B Plank
arXiv preprint arXiv:2005.14672, 2020
942020
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model
A Üstün, V Aryabumi, ZX Yong, WY Ko, D D'souza, G Onilude, N Bhandari, ...
arXiv preprint arXiv:2402.07827, 2024
802024
Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning
T Zadouri, A Üstün, A Ahmadian, B Ermiş, A Locatelli, S Hooker
arXiv preprint arXiv:2309.05444, 2023
662023
When Less is More: Investigating Data Pruning for Pretraining LLMs at Scale
M Marion, A Üstün, L Pozzobon, A Wang, M Fadaee, S Hooker
arXiv preprint arXiv:2309.04564, 2023
592023
Aya dataset: An open-access collection for multilingual instruction tuning
S Singh, F Vargus, D Dsouza, BF Karlsson, A Mahendiran, WY Ko, ...
arXiv preprint arXiv:2402.06619, 2024
522024
Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs
A Ahmadian, C Cremer, M Gallé, M Fadaee, J Kreutzer, A Üstün, ...
arXiv preprint arXiv:2402.14740, 2024
492024
Multilingual unsupervised neural machine translation with denoising adapters
A Üstün, A Berard, L Besacier, M Gallé
arXiv preprint arXiv:2110.10472, 2021
452021
Characters or morphemes: How to represent words?
A Üstün, M Kurfalı, B Can
Association for Computational Linguistics, 2018
452018
Siti Oryza Khairunnisa, Mamoru Komachi, and Barbara Plank. 2021. From masked language modeling to translation: Non-English auxiliary tasks improve zero-shot spoken language …
R Van Der Goot, I Sharaf, A Imankulova, A Üstün, M Stepanovic, ...
Proceedings of the 2021 Conference of the North American Chapter of the …, 2021
362021
Aya 23: Open weight releases to further multilingual progress
V Aryabumi, J Dang, D Talupuru, S Dash, D Cairuz, H Lin, B Venkitesh, ...
arXiv preprint arXiv:2405.15032, 2024
342024
Automatic judgement forecasting for pending applications of the European Court of Human Rights
M Medvedeva, A Üstün, X Xu, M Vols, M Wieling
Proceedings of the Fifth Workshop on Automatec Semantic Analysis of …, 2021
322021
Hyper-X: A Unified Hypernetwork for Multi-Task Multilingual Transfer
A Üstün, A Bisazza, G Bouma, G van Noord, S Ruder
arXiv preprint arXiv:2205.12148, 2022
292022
Intriguing properties of quantization at scale
A Ahmadian, S Dash, H Chen, B Venkitesh, ZS Gou, P Blunsom, A Üstün, ...
Advances in Neural Information Processing Systems 36, 34278-34294, 2023
262023
Unsupervised morphological segmentation using neural word embeddings
A Üstün, B Can
Statistical Language and Speech Processing: 4th International Conference …, 2016
202016
From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding
R van der Goot, I Sharaf, A Imankulova, A Üstün, M Stepanović, ...
arXiv preprint arXiv:2105.07316, 2021
142021
UDapter: Typology-based Language Adapters for Multilingual Dependency Parsing and Sequence Labeling
A Üstün, A Bisazza, G Bouma, G Noord
Computational Linguistics 48 (3), 555-592, 2022
122022
Turkish pos tagging by reducing sparsity with morpheme tags in small datasets
B Can, A Üstün, M Kurfalı
Computational Linguistics and Intelligent Text Processing: 17th …, 2018
112018
RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs
J Dang, A Ahmadian, K Marchisio, J Kreutzer, A Üstün, S Hooker
arXiv preprint arXiv:2407.02552, 2024
102024
When does Parameter-Efficient Transfer Learning Work for Machine Translation?
A Üstün, AC Stickland
arXiv preprint arXiv:2205.11277, 2022
82022
系統目前無法執行作業,請稍後再試。
文章 1–20