Výsledky vyhledávání - "Jones, Llion"

Report

Autor: Pickett, Marc, Nain, Aakash Kumar, Modayil, Joseph, Jones, Llion

Modern machine learning systems have demonstrated substantial abilities with methods that either embrace or ignore human-provided knowledge, but combining benefits of both styles remains a challenge. One particular challenge involves designing learni

Externí odkaz: http://arxiv.org/abs/2408.04242

Zobrazit plný text záznamu

Report

Transformer Layers as Painters

Autor: Sun, Qi, Pickett, Marc, Nain, Aakash Kumar, Jones, Llion

Despite their nearly universal adoption for large language models, the internal workings of transformers are not well understood. We aim to better understand the impact of removing or reorganizing information throughout the layers of a pretrained tra

Externí odkaz: http://arxiv.org/abs/2407.09298

Zobrazit plný text záznamu

Akademický článek

Natural Questions: A Benchmark for Question Answering Research

Autor: Kwiatkowski, Tom, Palomaki, Jennimaria, Redfield, Olivia, Collins, Michael, Parikh, Ankur, Alberti, Chris, Epstein, Danielle, Polosukhin, Illia, Devlin, Jacob, Lee, Kenton, Toutanova, Kristina, Jones, Llion, Kelcey, Matthew, Chang, Ming-Wei, Dai, Andrew M., Uszkoreit, Jakob, Le, Quoc, Petrov, Slav

Publikováno v: Transactions of the Association for Computational Linguistics, Vol 7, Pp 453-466 (2019)

We present the Natural Questions corpus, a question answering data set. Questions consist of real anonymized, aggregated queries issued to the Google search engine. An annotator is presented with a question along with a Wikipedia page from the top 5

Externí odkaz: https://doaj.org/article/8650fdc04d7944c4893d0b995b6de6f7

Zobrazit plný text záznamu

Report

Helpful Neighbors: Leveraging Neighbors in Geographic Feature Pronunciation

Autor: Jones, Llion, Sproat, Richard, Ishikawa, Haruko, Gutkin, Alexander

If one sees the place name Houston Mercer Dog Run in New York, how does one know how to pronounce it? Assuming one knows that Houston in New York is pronounced "how-ston" and not like the Texas city, then one can probably guess that "how-ston" is als

Externí odkaz: http://arxiv.org/abs/2210.10200

Zobrazit plný text záznamu

Report

DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement

Autor: Koizumi, Yuma, Karita, Shigeki, Wisdom, Scott, Erdogan, Hakan, Hershey, John R., Jones, Llion, Bacchiani, Michiel

Single-channel speech enhancement (SE) is an important task in speech processing. A widely used framework combines an analysis/synthesis filterbank with a mask prediction network, such as the Conv-TasNet architecture. In such systems, the denoising p

Externí odkaz: http://arxiv.org/abs/2106.15813

Zobrazit plný text záznamu

Report

A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition

Autor: Karita, Shigeki, Kubo, Yotaro, Bacchiani, Michiel Adriaan Unico, Jones, Llion

End-to-end (E2E) modeling is advantageous for automatic speech recognition (ASR) especially for Japanese since word-based tokenization of Japanese is not trivial, and E2E modeling is able to model character sequences directly. This paper focuses on t

Externí odkaz: http://arxiv.org/abs/2106.05111

Zobrazit plný text záznamu

Report

CodeTrans: Towards Cracking the Language of Silicon's Code Through Self-Supervised Deep Learning and High Performance Computing

Autor: Elnaggar, Ahmed, Ding, Wei, Jones, Llion, Gibbs, Tom, Feher, Tamas, Angerer, Christoph, Severini, Silvia, Matthes, Florian, Rost, Burkhard

Currently, a growing number of mature natural language processing applications make people's life more convenient. Such applications are built by source code - the language in software engineering. However, the applications for understanding source c

Externí odkaz: http://arxiv.org/abs/2104.02443

Zobrazit plný text záznamu

Report

ProtTrans: Towards Cracking the Language of Life's Code Through Self-Supervised Deep Learning and High Performance Computing

Autor: Elnaggar, Ahmed, Heinzinger, Michael, Dallago, Christian, Rihawi, Ghalia, Wang, Yu, Jones, Llion, Gibbs, Tom, Feher, Tamas, Angerer, Christoph, Steinegger, Martin, Bhowmik, Debsindhu, Rost, Burkhard

Computational biology and bioinformatics provide vast data gold-mines from protein sequences, ideal for Language Models taken from NLP. These LMs reach for new prediction frontiers at low inference costs. Here, we trained two auto-regressive models (

Externí odkaz: http://arxiv.org/abs/2007.06225

Zobrazit plný text záznamu

Report

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

Autor: Shen, Jonathan, Nguyen, Patrick, Wu, Yonghui, Chen, Zhifeng, Chen, Mia X., Jia, Ye, Kannan, Anjuli, Sainath, Tara, Cao, Yuan, Chiu, Chung-Cheng, He, Yanzhang, Chorowski, Jan, Hinsu, Smit, Laurenzo, Stella, Qin, James, Firat, Orhan, Macherey, Wolfgang, Gupta, Suyog, Bapna, Ankur, Zhang, Shuyuan, Pang, Ruoming, Weiss, Ron J., Prabhavalkar, Rohit, Liang, Qiao, Jacob, Benoit, Liang, Bowen, Lee, HyoukJoong, Chelba, Ciprian, Jean, Sébastien, Li, Bo, Johnson, Melvin, Anil, Rohan, Tibrewal, Rajat, Liu, Xiaobing, Eriguchi, Akiko, Jaitly, Navdeep, Ari, Naveen, Cherry, Colin, Haghani, Parisa, Good, Otavio, Cheng, Youlong, Alvarez, Raziel, Caswell, Isaac, Hsu, Wei-Ning, Yang, Zongheng, Wang, Kuan-Chieh, Gonina, Ekaterina, Tomanek, Katrin, Vanik, Ben, Wu, Zelin, Jones, Llion, Schuster, Mike, Huang, Yanping, Chen, Dehao, Irie, Kazuki, Foster, George, Richardson, John, Macherey, Klaus, Bruguier, Antoine, Zen, Heiga, Raffel, Colin, Kumar, Shankar, Rao, Kanishka, Rybach, David, Murray, Matthew, Peddinti, Vijayaditya, Krikun, Maxim, Bacchiani, Michiel A. U., Jablin, Thomas B., Suderman, Rob, Williams, Ian, Lee, Benjamin, Bhatia, Deepti, Carlson, Justin, Yavuz, Semih, Zhang, Yu, McGraw, Ian, Galkin, Max, Ge, Qi, Pundak, Golan, Whipkey, Chad, Wang, Todd, Alon, Uri, Lepikhin, Dmitry, Tian, Ye, Sabour, Sara, Chan, William, Toshniwal, Shubham, Liao, Baohua, Nirschl, Michael, Rondon, Pat

Lingvo is a Tensorflow framework offering a complete solution for collaborative deep learning research, with a particular focus towards sequence-to-sequence models. Lingvo models are composed of modular building blocks that are flexible and easily ex

Externí odkaz: http://arxiv.org/abs/1902.08295

Zobrazit plný text záznamu

Report

Character-Level Language Modeling with Deeper Self-Attention

Autor: Al-Rfou, Rami, Choe, Dokook, Constant, Noah, Guo, Mandy, Jones, Llion

LSTMs and other RNN variants have shown strong performance on character-level language modeling. These models are typically trained using truncated backpropagation through time, and it is common to assume that their success stems from their ability t

Externí odkaz: http://arxiv.org/abs/1808.04444

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání