A method for managing access to web pages: Filtering by Statistical Classification (FSC) applied to text

Autor: Eric Nyberg, George T. Duncan, Wenxuan Ding, Ramayya Krishnan, Jonathan P. Caulkins
Rok vydání: 2006
Předmět:
Zdroj: Decision Support Systems. 42:144-161
ISSN: 0167-9236
DOI: 10.1016/j.dss.2004.11.015
Popis: Various entities (e.g., parents, employers) that provide users (e.g., children, employees) access to web content wish to limit the content accessed through those computers. Available filtering methods are crude in that they too often block "acceptable" content while failing to block "unacceptable" content. This paper presents a general and flexible classification method based on statistical techniques applied to text material, that we call, Filtering by Statistical Classification (FSC). According to each individual entity's expressed opinions about what content in a training data set is or is not acceptable, FSC constructs a customized model to represent each individual entity's preferences. FSC then uses this customized model to examine new web content and to block unwanted content. The empirical results suggest that our method has greater predictive power than do a variety of existing approaches.
Databáze: OpenAIRE