Výsledky vyhledávání - "Caglar Gulcehre"

Dynamic Neural Turing Machine with Continuous and Discrete Addressing Schemes

Autor: Kyunghyun Cho, Yoshua Bengio, Sarath Chandar, Caglar Gulcehre

Publikováno v: Neural Computation. 30:857-884

We extend the neural Turing machine (NTM) model into a dynamic neural Turing machine (D-NTM) by introducing trainable address vectors. This addressing scheme maintains for each memory cell two separate vectors, content and address vectors. This allow

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b4a571a2d221941010825c3b0af45dc5
https://doi.org/10.1162/neco_a_01060

Zobrazit plný text záznamu

Plný text ve formátu HTML

On integrating a language model into neural machine translation

Autor: Orhan Firat, Kelvin Xu, Yoshua Bengio, Caglar Gulcehre, Kyunghyun Cho

Publikováno v: Computer Speech & Language. 45:137-148

Recent advances in end-to-end neural machine translation models have achieved promising results on high-resource language pairs such as En -> Fr and En -> De. One of the major factor behind these successes is the availability of high quality parallel

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::568a8163620d5ea8c9e15a2775e7219c
https://doi.org/10.1016/j.csl.2017.01.014

Zobrazit plný text záznamu

Gated Orthogonal Recurrent Units: On Learning to Forget

Autor: Max Tegmark, John Peurifoy, Marin Soljacic, Li Jing, Yichen Shen, Caglar Gulcehre, Yoshua Bengio

Publikováno v: Neural computation. 31(4)

We present a novel recurrent neural network (RNN)–based model that combines the remembering ability of unitary evolution RNNs with the ability of gated RNNs to effectively forget redundant or irrelevant information in its memory. We achieve this by

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::eb6d93428dbde775a2b248680839d679
https://pubmed.ncbi.nlm.nih.gov/30764742

Zobrazit plný text záznamu

Plný text ve formátu HTML

Machine Comprehension by Text-to-Text Neural Question Generation

Autor: Tong Wang, Xingdi Yuan, Caglar Gulcehre, Alessandro Sordoni, Adam Trischler, Sandeep Subramanian, Philip Bachman, Saizheng Zhang

Publikováno v: Rep4NLP@ACL

We propose a recurrent neural model that generates natural-language questions from documents, conditioned on answers. We show how to train the model using a combination of supervised and reinforcement learning. After teacher forcing for standard maxi

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b21ecb94f3bd5b7a53792e793ee451c3
http://arxiv.org/abs/1705.02012

Zobrazit plný text záznamu

Plan, Attend, Generate: Character-Level Neural Machine Translation with Planning

Autor: Adam Trischler, Yoshua Bengio, Francis Dutil, Caglar Gulcehre

Publikováno v: Rep4NLP@ACL

We investigate the integration of a planning mechanism into an encoder-decoder architecture with attention. We develop a model that can plan ahead when it computes alignments between the source and target sequences not only for a single time-step but

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::49e9107c53a223e71728f442ec3094e5
https://doi.org/10.18653/v1/w17-2627

Zobrazit plný text záznamu

A Robust Adaptive Stochastic Gradient Method for Deep Learning

Autor: Marcin Moczulski, Caglar Gulcehre, Yoshua Bengio, Jose Sotelo

Publikováno v: IJCNN

Stochastic gradient algorithms are the main focus of large-scale optimization problems and led to important successes in the recent advancement of the deep learning algorithms. The convergence of SGD depends on the careful choice of learning rate and

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::415f49aca507cf3100d8980af2f087f3

Zobrazit plný text záznamu

Pointing the Unknown Words

Autor: Yoshua Bengio, Ramesh Nallapati, Bowen Zhou, Caglar Gulcehre, Sungjin Ahn

Publikováno v: ACL (1)

The problem of rare and unknown words is an important issue that can potentially influence the performance of many NLP systems, including both the traditional count-based and the deep learning models. We propose a novel way to deal with the rare and

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d511039502a020fa0ceeb566bd664ade
http://arxiv.org/abs/1603.08148

Zobrazit plný text záznamu

Abstractive Text Summarization Using Sequence-to-Sequence RNNs and Beyond

Autor: Bowen Zhou, Cicero Nogueira dos Santos, Bing Xiang, Caglar Gulcehre, Ramesh Nallapati

Publikováno v: CoNLL

In this work, we model abstractive text summarization using Attentional Encoder-Decoder Recurrent Neural Networks, and show that they achieve state-of-the-art performance on two different corpora. We propose several novel models that address critical

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7529a765525c9159f4fbe497c386fc9d

Zobrazit plný text záznamu

Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus

Autor: Aaron Courville, Yoshua Bengio, Alberto Garcia-Duran, Caglar Gulcehre, Sungjin Ahn, Iulian Vlad Serban, Sarath Chandar

Publikováno v: ACL (1)

Over the past decade, large-scale supervised learning corpora have enabled machine learning researchers to make substantial advances. However, to this date, there are no large-scale question-answer corpora available. In this paper we present the 30M

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ee701b34c9cdc5569b45ce0e14e17efa

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání