Regularizing Output Distribution of Abstractive Chinese Social Media Text Summarization for Improved Semantic Consistency

Autor:	Xiaoyan Cai, Xuancheng Ren, Bingzhen Wei, Xu Sun, Qi Su, Yi Zhang
Jazyk:	angličtina
Rok vydání:	2018
Předmět:	FOS: Computer and information sciences Computer Science - Computation and Language General Computer Science business.industry Computer science 05 social sciences 050401 social sciences methods computer.software_genre Semantics Regularization (mathematics) Automatic summarization Task (project management) 030507 speech-language pathology & audiology 03 medical and health sciences Distribution (mathematics) 0504 sociology Social media Artificial intelligence 0305 other medical science business Spurious relationship Computation and Language (cs.CL) computer Word (computer architecture) Natural language processing
Popis:	ive text summarization is a highly difficult problem, and the sequence-to-sequence model has shown success in improving the performance on the task. However, the generated summaries are often inconsistent with the source content in semantics. In such cases, when generating summaries, the model selects semantically unrelated words with respect to the source content as the most probable output. The problem can be attributed to heuristically constructed training data, where summaries can be unrelated to the source content, thus containing semantically unrelated words and spurious word correspondence. In this article, we propose a regularization approach for the sequence-to-sequence model and make use of what the model has learned to regularize the learning objective to alleviate the effect of the problem. In addition, we propose a practical human evaluation method to address the problem that the existing automatic evaluation method does not evaluate the semantic consistency with the source content properly. Experimental results demonstrate the effectiveness of the proposed approach, which outperforms almost all the existing models. Especially, the proposed approach improves the semantic consistency by 4% in terms of human evaluation.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1c011de13c3b498c55080e2d0539885b http://arxiv.org/abs/1805.04033 Zobrazit plný text záznamu