Výsledky vyhledávání - "Dinghan Shen"

Vision-Language Navigation Policy Learning and Adaptation

Autor: Yuan-Fang Wang, Xin Wang, Dinghan Shen, William Yang Wang, Qiuyuan Huang, Lei Zhang, Asli Celikyilmaz, Jianfeng Gao

Publikováno v: IEEE Transactions on Pattern Analysis and Machine Intelligence. 43:4205-4216

Vision-language navigation (VLN) is the task of navigating an embodied agent to carry out natural language instructions inside real 3D environments. In this paper, we study how to address three critical challenges for this task: the cross-modal groun

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f88ade2d253663bfb71cce026896cfb7
https://doi.org/10.1109/tpami.2020.2972281

Zobrazit plný text záznamu

HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalizability

Autor: Jiaao Chen, Dinghan Shen, Diyi Yang, Weizhu Chen

Publikováno v: ACL/IJCNLP (1)

Fine-tuning large pre-trained models with task-specific data has achieved great success in NLP. However, it has been demonstrated that the majority of information within the self-attention networks is redundant and not utilized effectively during the

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::8b656f4c33d81e104c2c8f096cca6b08
https://doi.org/10.18653/v1/2021.acl-long.338

Zobrazit plný text záznamu

Improving Text Generation with Student-Forcing Optimal Transport

Autor: Qian Yang, Chunyuan Li, Yizhe Zhang, Lawrence Carin, Wenlin Wang, Jianqiao Li, Liqun Chen, Yuh-Chen Lin, Hao Fu, Chenyang Tao, Guoyin Wang, Dinghan Shen, Ruiyi Zhang

Publikováno v: EMNLP (1)

Neural language models are often trained with maximum likelihood estimation (MLE), where the next word is generated conditioned on the ground-truth word tokens. During testing, however, the model is instead conditioned on previously generated tokens,

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::4cb90e0003c412a95b63c2561e0d0d31
http://arxiv.org/abs/2010.05994

Zobrazit plný text záznamu

Improving Adversarial Text Generation by Modeling the Distant Future

Autor: Wenlin Wang, Zheng Wen, Changyou Chen, Ruiyi Zhang, Zhe Gan, Dinghan Shen, Lawrence Carin, Guoyin Wang

Publikováno v: ACL

Auto-regressive text generation models usually focus on local fluency, and may cause inconsistent semantic meaning in long text generation. Further, automatically generating words with similar semantics is challenging, and hand-crafted linguistic rul

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::a20c60bf3c9e0c6983229a4f66e0673d
http://arxiv.org/abs/2005.01279

Zobrazit plný text záznamu

Generative Semantic Hashing Enhanced via Boltzmann Machines

Autor: Qinliang Su, Lin Zheng, Dinghan Shen, Changyou Chen

Publikováno v: ACL

Generative semantic hashing is a promising technique for large-scale information retrieval thanks to its fast retrieval speed and small memory footprint. For the tractability of training, existing generative-hashing methods mostly assume a factorized

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::0f3950f2fb7bdaaafafc8d796a5de3d9
https://doi.org/10.18653/v1/2020.acl-main.71

Zobrazit plný text záznamu

Improving Disentangled Text Representation Learning with Information-Theoretic Guidance

Autor: Christopher Malon, Martin Renqiang Min, Pengyu Cheng, Yitong Li, Dinghan Shen, Yizhe Zhang, Lawrence Carin

Publikováno v: ACL

Learning disentangled representations of natural language is essential for many NLP tasks, e.g., conditional text generation, style transfer, personalized dialogue systems, etc. Similar problems have been studied extensively for other forms of data,

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::367011a9acfab522a5e3bfeed2dfc79a
https://doi.org/10.18653/v1/2020.acl-main.673

Zobrazit plný text záznamu

Document Hashing with Mixture-Prior Generative Models

Autor: Qinliang Su, Wei Dong, Dinghan Shen, Changyou Chen

Publikováno v: EMNLP/IJCNLP (1)

Hashing is promising for large-scale information retrieval tasks thanks to the efficiency of distance evaluation between binary codes. Generative hashing is often used to generate hashing codes in an unsupervised way. However, existing generative has

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::4313a0faa1752b019a2438c85017023b
https://doi.org/10.18653/v1/d19-1526

Zobrazit plný text záznamu

Topic-Guided Variational Auto-Encoder for Text Generation

Autor: Zhe Gan, Lawrence Carin, Wenlin Wang, Ruiyi Zhang, Dinghan Shen, Hongteng Xu, Guoyin Wang, Changyou Chen

Publikováno v: NAACL-HLT (1)

We propose a topic-guided variational auto-encoder (TGVAE) model for text generation. Distinct from existing variational auto-encoder (VAE) based approaches, which assume a simple Gaussian prior for latent code, our model specifies the prior as a Gau

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::d1c6fcd38c68b97cb6c91047b17fa0fd
https://doi.org/10.18653/v1/n19-1015

Zobrazit plný text záznamu

Learning Compressed Sentence Representations for On-Device Text Processing

Autor: Asli Celikyilmaz, Lawrence Carin, Dhanasekar Sundararaman, Xinyuan Zhang, Qian Yang, Dinghan Shen, Meng Tang, Pengyu Cheng

Publikováno v: Scopus-Elsevier
ACL (1)

Vector representations of sentences, trained on massive text corpora, are widely used as generic sentence embeddings across a variety of NLP problems. The learned representations are generally assumed to be continuous and real-valued, giving rise to

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::25f6b14daeb1a366058b956a4754cf9d

Zobrazit plný text záznamu

Towards Generating Long and Coherent Text with Multi-Level Latent Variable Models

Autor: Liqun Chen, Dinghan Shen, Asli Celikyilmaz, Lawrence Carin, Jianfeng Gao, Yizhe Zhang, Xin Wang

Publikováno v: Scopus-Elsevier
ACL (1)

Variational autoencoders (VAEs) have received much attention recently as an end-to-end architecture for text generation with latent variables. In this paper, we investigate several multi-level structures to learn a VAE model to generate long, and coh

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::dafc5fafdf210f0f5620b6af8432f4fb

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání