Evolutionary Study of Web Spam: Webb Spam Corpus 2011 versus Webb Spam Corpus 2006

Autor: Danesh Irani, Calton Pu, De Wang
Rok vydání: 2012
Předmět:
Zdroj: CollaborateCom
DOI: 10.4108/icst.collaboratecom.2012.250689
Popis: With over 2.5 hours a day spent browsing websites online [1] and with over a billion pages [2], identifying and detecting web spam is an important problem. Although large corpora of legitimate web pages are available to researchers, the same cannot be said about web spam or spam web pages.
Databáze: OpenAIRE