FastSum

Autor:	Ravi Kondadadi, Frank Schilder
Rok vydání:	2008
Předmět:	Information retrieval Parsing Computer science business.industry Part of speech computer.software_genre Automatic summarization Set (abstract data type) Support vector machine Ranking Multi-document summarization ComputingMethodologies_DOCUMENTANDTEXTPROCESSING Artificial intelligence business computer Natural language processing
Zdroj:	ACL (Short Papers)
Popis:	We present a fast query-based multi-document summarizer called FastSum based solely on word-frequency features of clusters, documents and topics. Summary sentences are ranked by a regression SVM. The summarizer does not use any expensive NLP techniques such as parsing, tagging of names or even part of speech information. Still, the achieved accuracy is comparable to the best systems presented in recent academic competitions (i.e., Document Understanding Conference (DUC)). Because of a detailed feature analysis using Least Angle Regression (LARS), FastSum can rely on a minimal set of features leading to fast processing times: 1250 news documents in 60 seconds.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::d2cb1569c03647fcc6a2fa189a822d1c https://doi.org/10.3115/1557690.1557748 Zobrazit plný text záznamu