Improving Vietnamese Sentence Compression by Segmenting Meaning Chunks

Autor: Van-Giau Ung, Minh-Quoc Nghiem, Ngan Luu-Thuy Nguyen, An-Vinh Luong, Nhi-Thao Tran
Rok vydání: 2015
Předmět:
Zdroj: KSE
DOI: 10.1109/kse.2015.74
Popis: This paper proposes an approach for sentence compression that only requires the part-of-speech information. The method is based on an observation of the human compression: adjacent words which form a meaning chunk usually are removed or retained together. We incorporate meaning chunk as a feature for a CRF-based sequence labeling system. Experimental results on English and Vietnamese compression datasets show that the proposed approach achieved better performance than the state-of-the-art systems.
Databáze: OpenAIRE