Text Summarization by Sentence Extraction Using Unsupervised Learning.

Autor: García-Hernández, René Arnulfo, Montiel, Romyna, Ledeneva, Yulia, Rendón, Eréndira, Gelbukh, Alexander, Cruz, Rafael
Zdroj: Micai 2008: Advances in Artificial Intelligence; 2008, p133-143, 11p
Abstrakt: The main problem for generating an extractive automatic text summary is to detect the most relevant information in the source document. Although, some approaches claim being domain and language independent, they use high dependence knowledge like key-phrases or golden samples for machine-learning approaches. In this work, we propose a language- and domain-independent automatic text summarization approach by sentence extraction using an unsupervised learning algorithm. Our hypothesis is that an unsupervised algorithm can help for clustering similar ideas (sentences). Then, for composing the summary, the most representative sentence is selected from each cluster. Several experiments in the standard DUC-2002 collection show that the proposed method obtains more favorable results than other approaches. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index