Measuring Peculiarity of Text Using Relation between Words on the Web.

Autor: Nakabayashi, Takeru, Yumoto, Takayuki, ii, Manabu, Takahashi, Yutaka, Sumiya, Kazutoshi
Zdroj: Role of Digital Libraries in a Time of Global Change; 2010, p112-115, 4p
Abstrakt: We define the peculiarity of text as a metric of information credibility. Higher peculiarity means lower credibility. We extract the theme word and the characteristic words from text and check whether there is a subject-description relation between them. The peculiarity is defined using the ratio of the subject-description relation between a theme word and characteristic words. We evaluate the extent to which peculiarity can be used to judge by classifying text from Wikipedia and Uncyclopedia in terms of the peculiarity. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index