Zobrazeno 1 - 4
of 4
pro vyhledávání: '"Dewan, Christopher"'
Autor:
Iyer, Srinivasan, Lin, Xi Victoria, Pasunuru, Ramakanth, Mihaylov, Todor, Simig, Daniel, Yu, Ping, Shuster, Kurt, Wang, Tianlu, Liu, Qing, Koura, Punit Singh, Li, Xian, O'Horo, Brian, Pereyra, Gabriel, Wang, Jeff, Dewan, Christopher, Celikyilmaz, Asli, Zettlemoyer, Luke, Stoyanov, Ves
Recent work has shown that fine-tuning large pre-trained language models on a collection of tasks described via instructions, a.k.a. instruction-tuning, improves their zero and few-shot generalization to unseen tasks. However, there is a limited unde
Externí odkaz:
http://arxiv.org/abs/2212.12017
Autor:
Zhang, Susan, Roller, Stephen, Goyal, Naman, Artetxe, Mikel, Chen, Moya, Chen, Shuohui, Dewan, Christopher, Diab, Mona, Li, Xian, Lin, Xi Victoria, Mihaylov, Todor, Ott, Myle, Shleifer, Sam, Shuster, Kurt, Simig, Daniel, Koura, Punit Singh, Sridhar, Anjali, Wang, Tianlu, Zettlemoyer, Luke
Large language models, which are often trained for hundreds of thousands of compute days, have shown remarkable capabilities for zero- and few-shot learning. Given their computational cost, these models are difficult to replicate without significant
Externí odkaz:
http://arxiv.org/abs/2205.01068
Autor:
Aly, Ahmed, Lakhotia, Kushal, Zhao, Shicong, Mohit, Mrinal, Oguz, Barlas, Arora, Abhinav, Gupta, Sonal, Dewan, Christopher, Nelson-Lindall, Stef, Shah, Rushin
We introduce PyText - a deep learning based NLP modeling framework built on PyTorch. PyText addresses the often-conflicting requirements of enabling rapid experimentation and of serving models at scale. It achieves this by providing simple and extens
Externí odkaz:
http://arxiv.org/abs/1812.08729
Autor:
DeWan, Christopher
Publikováno v:
Grey Sparrow Journal; Spring2013, Vol. 4 Issue 2, p1-3, 3p