Zobrazeno 1 - 10
of 16
pro vyhledávání: '"Vishrav Chaudhary"'
Autor:
Katharina Kann, Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, John E. Ortega, Annette Rios, Angela Fan, Ximena Gutierrez-Vasques, Luis Chiruzzo, Gustavo A. Giménez-Lugo, Ricardo Ramos, Ivan Vladimir Meza Ruiz, Elisabeth Mager, Vishrav Chaudhary, Graham Neubig, Alexis Palmer, Rolando Coto-Solano, Ngoc Thang Vu
Publikováno v:
Frontiers in Artificial Intelligence, Vol 5 (2022)
Little attention has been paid to the development of human language technology for truly low-resource languages—i.e., languages with limited amounts of digitally available text data, such as Indigenous languages. However, it has been shown that pre
Externí odkaz:
https://doaj.org/article/3d0de02183b848f3900a562e6f99d495
Autor:
Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, Vishrav Chaudhary, Luis Chiruzzo, Angela Fan, John Ortega, Ricardo Ramos, Annette Rios, Ivan Vladimir Meza Ruiz, Gustavo Giménez-Lugo, Elisabeth Mager, Graham Neubig, Alexis Palmer, Rolando Coto-Solano, Thang Vu, Katharina Kann
Pretrained multilingual models are able to perform cross-lingual transfer in a zero-shot setting, even for languages unseen during pretraining. However, prior work evaluating performance on unseen languages has largely been limited to low-level, synt
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::e09ad28a5e1b4d5cb2039832f3a27c03
http://arxiv.org/abs/2104.08726
http://arxiv.org/abs/2104.08726
Autor:
Francisco Guzmán, Yi-Lin Tuan, Lucia Specia, Vishrav Chaudhary, Ahmed El-Kishky, Adithya Renduchintala
Publikováno v:
EACL
Quality estimation aims to measure the quality of translated content without access to a reference translation. This is crucial for machine translation systems in real-world scenarios where high-quality translation is needed. While many approaches ex
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::9935ddf0966987919570fb68f81b2fed
http://arxiv.org/abs/2102.04020
http://arxiv.org/abs/2102.04020
Publikováno v:
EACL
We present an approach based on multilingual sentence embeddings to automatically extract parallel sentences from the content of Wikipedia articles in 85 languages, including several dialects or low-resource languages. We do not limit the the extract
Autor:
Elisabeth Mager-Hois, Gustavo A. Giménez-Lugo, Luis Chiruzzo, John Ortega, Alexis Palmer, Manuel Mager, Ngoc Thang Vu, Annette Rios, Arturo Oncevay, Angela Fan, Ximena Gutierrez-Vasques, Ivan Meza, Rolando Coto-Solano, Vishrav Chaudhary, Abteen Ebrahimi, Ricardo Argenton Ramos, Katharina Kann, Graham Neubig
Publikováno v:
Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas.
This paper presents the results of the 2021 Shared Task on Open Machine Translation for Indigenous Languages of the Americas. The shared task featured two independent tracks, and participants submitted machine translation systems for up to 10 indigen
Autor:
Yuqing Tang, Jiatao Gu, Naman Goyal, Vishrav Chaudhary, Peng-Jen Chen, Angela Fan, Xian Li, Chau Tran
Publikováno v:
ACL/IJCNLP (Findings)
Autor:
Naman Goyal, Cynthia Gao, Vishrav Chaudhary, Peng-Jen Chen, Guillaume Wenzek, Da Ju, Sanjana Krishnan, Marc’Aurelio Ranzato, Francisco Guzmán, Angela Fan
One of the biggest challenges hindering progress in low-resource and multilingual machine translation is the lack of good evaluation benchmarks. Current evaluation benchmarks either lack good coverage of low-resource languages, consider only restrict
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::daff96849911589a8d8438c21e71b6b7
Autor:
Francisco Guzmán, Mona Diab, Ahmed El-Kishky, Philipp Koehn, Pascale Fung, Vishrav Chaudhary, Adithya Renduchintala, Wei-Jen Ko, Naman Goyal
Publikováno v:
ACL/IJCNLP (1)
The scarcity of parallel data is a major obstacle for training high-quality machine translation systems for low-resource languages. Fortunately, some low-resource languages are linguistically related or similar to high-resource languages; these relat
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f454fecd833513a1121ba2c6cd2ab066
Autor:
Frédéric Blain, Nikolaos Aletras, Francisco Guzmán, Lisa Yankovskaya, Mark Fishel, Lucia Specia, Marina Fomicheva, Shuo Sun, Vishrav Chaudhary
Quality Estimation (QE) is an important component in making Machine Translation (MT) useful in real-world applications, as it is aimed to inform the user on the quality of the MT output at test time. Existing approaches require large amounts of exper
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::e9788592bf255a1df4f919b5059a26c0
http://hdl.handle.net/10044/1/84005
http://hdl.handle.net/10044/1/84005
Autor:
Edouard Grave, Veselin Stoyanov, Vishrav Chaudhary, Beliz Gunel, Jingfei Du, Onur Celebi, Michael Auli, Alexis Conneau
Publikováno v:
NAACL-HLT
Unsupervised pre-training has led to much recent progress in natural language understanding. In this paper, we study self-training as another way to leverage unlabeled data through semi-supervised learning. To obtain additional data for a specific ta
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::35d9966a169360a0ecb857f71449e7a5