Live monitoring 4chan discussion threads

Autor: Pozzana, Iacopo, Prifti, Ylli, Provetti, Alessandro
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Zdroj: IC2S2 2021: 7th International Conference on Computational Social Science
Popis: The 4chan portal has been known for several years as a ``fringe'' internet service for sharing and commenting pictures.\ud Thanks to the possibility to post anonymously, guaranteed by the total lack of a registration/identification mechanism, the portal has somewhat evolved to a global, if mostly US-centred, locus for the posting of extreme views, including racism and all sorts of hate speech. \ud A pivotal role in the emergence of the website as a bastion of ``free speech" has been played by the /pol/ board (https://boards.4chan.org/pol/), which declares its commitment to host ``politically incorrect'' discussions.\ud Several research groups have intensively studied 4chan structure, dynamics and contents. \ud Thanks to works such as[4, 12], we now have a fairly clear description of how 4chan works and what type of discussion dynamics the site supports. \ud In particular, the latter work shed light on the extremely ephemeral nature of discussions, with threads lasting on the website for a few hours at most, and often just for minutes - depending on the traffic they generate - before being removed to make room for new discussion.\ud Given the fast-paced nature of the evolution of the content of the boards, and especially given how such ephemerality shapes the tone and the content of the discussion itself [4, 14], it is of extreme importance for researchers to be able to capture the content of the threads at various points over the course of their short lives.\ud To the best of our knowledge, the existing 4chan literature has relied either on autoptic exploration by the scholars [14], or on large scale data collection campaigns that drew their content from the archived versions of the threads [12], i.e. on copies of the threads as they appeared at the time of their closure, and at that time only.\ud In order to observe at a more fine-grained level the content on the website, we devised a ``scraping'' architecture, summarised in Figure 2, which based on the OXPath platform [9]. \ud It enables the retrieval of the threads posted on a board at various points while they are still live.
Databáze: OpenAIRE