Disinformation: analysis and identification
Autor: | Archita Pathak, Rohini K. Srihari, Nihit Natu |
---|---|
Rok vydání: | 2021 |
Předmět: |
General Computer Science
Process (engineering) business.industry Computer science Applied Mathematics General Decision Sciences Context (language use) Data science Task (project management) Computational Mathematics Identification (information) Harm S.I. : SBP-BRiMS2020 Modeling and Simulation Disinformation Web application business Set (psychology) |
Zdroj: | Computational and Mathematical Organization Theory |
ISSN: | 1572-9346 1381-298X |
Popis: | We present an extensive study on disinformation, which is defined as information that is false and misleading and intentionally shared to cause harm. Through this work, we aim to answer the following questions:Can we automatically and accurately classify a news article as containing disinformation?What characteristics of disinformation differentiate it from other types of benign information? We conduct this study in the context of two significant events: the US elections of 2016 and the 2020 COVID pandemic. We build a series of classifiers to (i) examine linguistic clues exhibited by different types of fake news articles, (ii) analyze “clickbaityness” of disinformation headlines, and (iii) finally, perform fine-grained, veracity-based article classification through a natural language inference (NLI) module for automated disinformation verification; this utilizes a manually curated set of evidence sources. For the latter, we built a new dataset that is annotated with generic, veracity-based labels and ground truth evidence supporting each label. The veracity labels were formulated based on examining standards used by reputable fact-checking organizations. We show that disinformation derives features from both propaganda and mainstream news, making it more challenging to detect. However, there is significant potential for automating the fact-checking process to incorporate the degree of veracity. We provide error analysis that illustrates the challenges involved in the automated fact-checking task and identifies factors that may improve this process in future work. Finally, we also describe the implementation of a web app that extracts important entities and actions from a given article and searches the web to gather evidence from credible sources. The evidence articles are then used to generate a veracity label that can assist manual fact-checkers engaged in combating disinformation. |
Databáze: | OpenAIRE |
Externí odkaz: |