'Digital Narratives of COVID-19': A Twitter Dataset for Text Analysis in Spanish

Autor: Susanna Allés-Torrent, Gimena del Rio Riande, Jerry Bonnell, Dieyun Song, Nidia Hernández
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Zdroj: Journal of Open Humanities Data, Vol 7 (2021)
Druh dokumentu: article
ISSN: 2059-481X
DOI: 10.5334/johd.28
Popis: 'Digital Narratives of COVID-19' (DHCovid) offers a curated Twitter corpus of digital conversations about the Coronavirus pandemic. The dataset is collected through a script via Twitter’s Application Programming Interface (API) starting on April 24th, 2020, and stored on GitHub as an open access repository of tweet identifiers that can be consulted, downloaded, and reused by scholars interested in Natural Language Processing (NLP), topic modelling, and other quantitative methods. A stable version of the dataset has also been released through Zenodo. Twitter datasets are structured in three main collections: tweets in Spanish worldwide; geolocated tweets in six Spanish-speaking areas spanning North and Central America (Mexico, Colombia, Ecuador), South America (Argentina, Peru), and Europe (Spain); and geolocated tweets in English and Spanish from the greater Miami area in South Florida.
Databáze: Directory of Open Access Journals