Improving Vietnamese Sentence Compression by Segmenting Meaning Chunks
Autor: | Van-Giau Ung, Minh-Quoc Nghiem, Ngan Luu-Thuy Nguyen, An-Vinh Luong, Nhi-Thao Tran |
---|---|
Rok vydání: | 2015 |
Předmět: |
Sentence compression
Computer science business.industry Speech recognition Vietnamese computer.software_genre Sequence labeling language.human_language Market segmentation Compression (functional analysis) Feature (machine learning) language Meaning (existential) Artificial intelligence business computer Natural language processing |
Zdroj: | KSE |
DOI: | 10.1109/kse.2015.74 |
Popis: | This paper proposes an approach for sentence compression that only requires the part-of-speech information. The method is based on an observation of the human compression: adjacent words which form a meaning chunk usually are removed or retained together. We incorporate meaning chunk as a feature for a CRF-based sequence labeling system. Experimental results on English and Vietnamese compression datasets show that the proposed approach achieved better performance than the state-of-the-art systems. |
Databáze: | OpenAIRE |
Externí odkaz: |