Výsledky vyhledávání

Properties, Prediction, and Prevalence of Useful User-Generated Comments for Descriptive Annotation of Social Media Objects

Autor: Elaheh Momeni, Claire Cardie, Myle Ott

Publikováno v: Proceedings of the International AAAI Conference on Web and Social Media. 7:390-399

User-generated comments in online social media have recently been gaining increasing attention as a viable source of general-purpose descriptive annotations for digital objects like photos or videos. Because users have different levels of expertise,

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::5d260ed041eeec041e0065d100deded5
https://doi.org/10.1609/icwsm.v7i1.14410

Zobrazit plný text záznamu

Recipes for Building an Open-Domain Chatbot

Autor: Jing Xu, Stephen Roller, Eric Michael Smith, Jason Weston, Yinhan Liu, Emily Dinan, Y-Lan Boureau, Mary Williamson, Naman Goyal, Myle Ott, Da Ju

Publikováno v: EACL

Building open-domain chatbots is a challenging area for machine learning research. While prior work has shown that scaling neural models in the number of parameters and the size of the data they are trained on gives improved results, we highlight oth

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::6facc402fb21afc93ceda800bd7370e1
https://doi.org/10.18653/v1/2021.eacl-main.24

Zobrazit plný text záznamu

Larger-Scale Transformers for Multilingual Masked Language Modeling

Autor: Giri Anantharaman, Naman Goyal, Myle Ott, Alexis Conneau, Jingfei Du

Publikováno v: Proceedings of the 6th Workshop on Representation Learning for NLP (RepL4NLP-2021).

Recent work has demonstrated the effectiveness of cross-lingual language model pretraining for cross-lingual understanding. In this study, we present the results of two larger multilingual masked language models, with 3.5B and 10.7B parameters. Our t

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f5d6bbc0644e84fd48171f773206c020
https://doi.org/10.18653/v1/2021.repl4nlp-1.4

Zobrazit plný text záznamu

Analyzing the Forgetting Problem in Pretrain-Finetuning of Open-domain Dialogue Response Models

Autor: Kyunghyun Cho, Bing Liu, Jun Liu, Fuchun Peng, James Glass, Tianxing He, Myle Ott

Publikováno v: EACL

In this work, we study how the finetuning stage in the pretrain-finetune framework changes the behavior of a pretrained neural language generator. We focus on the transformer encoder-decoder model for the open-domain dialogue response generation task

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::18915bb2b3f93c816bd1a4ae1f72165a
https://doi.org/10.18653/v1/2021.eacl-main.95

Zobrazit plný text záznamu

General Purpose Text Embeddings from Pre-trained Language Models for Scalable Inference

Autor: Jingfei Du, Myle Ott, Haoran Li, Veselin Stoyanov, Xing Zhou

Publikováno v: EMNLP (Findings)

The state of the art on many NLP tasks is currently achieved by large pre-trained language models, which require a considerable amount of computation. We explore a setting where many different predictions are made on a single piece of text. In that c

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b7e9855dc87dcb31c324e0d75b180c8a

Zobrazit plný text záznamu

Pretrained Language Models for Biomedical and Clinical Tasks: Understanding and Extending the State-of-the-Art

Autor: Jingfei Du, Myle Ott, Veselin Stoyanov, Patrick S. H. Lewis

Publikováno v: ClinicalNLP@EMNLP

A large array of pretrained models are available to the biomedical NLP (BioNLP) community. Finding the best model for a particular task can be difficult and time-consuming. For many applications in the biomedical and clinical domains, it is crucial t

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::213ee3584b01e8b1f556b5f96cfe2ae7
https://doi.org/10.18653/v1/2020.clinicalnlp-1.17

Zobrazit plný text záznamu

Unsupervised Cross-lingual Representation Learning at Scale

Autor: Edouard Grave, Vishrav Chaudhary, Guillaume Wenzek, Luke Zettlemoyer, Kartikay Khandelwal, Veselin Stoyanov, Naman Goyal, Myle Ott, Alexis Conneau, Francisco Guzmán

Publikováno v: ACL

This paper shows that pretraining multilingual language models at scale leads to significant performance gains for a wide range of cross-lingual transfer tasks. We train a Transformer-based masked language model on one hundred languages, using more t

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c85bb0ac4ff2df447a9785930a1bf01d
https://doi.org/10.18653/v1/2020.acl-main.747

Zobrazit plný text záznamu

How Decoding Strategies Affect the Verifiability of Generated Text

Autor: Fabio Petroni, Fabrizio Silvestri, Sebastian Riedel, Myle Ott, Vassilis Plachouras, Luca Massarelli, Tim Rocktäschel, Aleksandra Piktus

Publikováno v: EMNLP (Findings)

Recent progress in pre-trained language models led to systems that are able to generate text of an increasingly high quality. While several works have investigated the fluency and grammatical correctness of such models, it is still unclear to which e

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f7e467f6dcdde9937ce4557a3ea48756
https://doi.org/10.18653/v1/2020.findings-emnlp.22

Zobrazit plný text záznamu

The Source-Target Domain Mismatch Problem in Machine Translation

Autor: Jiatao Gu, Jiajun Shen, Junxian He, Michael Auli, Peng-Jen Chen, Matthew Le, Marc'Aurelio Ranzato, Myle Ott

Publikováno v: EACL

While we live in an increasingly interconnected world, different places still exhibit strikingly different cultures and many events we experience in our every day life pertain only to the specific place we live in. As a result, people often talk abou

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1ce4165096d248e3f51629421c920fcc
http://arxiv.org/abs/1909.13151

Zobrazit plný text záznamu

On The Evaluation of Machine Translation Systems Trained With Back-Translation

Autor: Sergey Edunov, Michael Auli, Myle Ott, Marc'Aurelio Ranzato

Publikováno v: ACL

Back-translation is a widely used data augmentation technique which leverages target monolingual data. However, its effectiveness has been challenged since automatic metrics such as BLEU only show significant improvements for test examples where the

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5bb95df2b53bdcc07bf8c55b97d2a183
http://arxiv.org/abs/1908.05204

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání