Gender Distribution across Topics in the Top 5 Economics Journals: A Machine Learning Approach

Autor: Conde Ruiz, José Ignacio, Ganuza, Juan-José, García, Manu, Puch, Luis A.
Rok vydání: 2021
Předmět:
Zdroj: E-Prints Complutense: Archivo Institucional de la UCM
Universidad Complutense de Madrid
E-Prints Complutense. Archivo Institucional de la UCM
instname
Popis: We analyze all the articles published in the top five (T5) Economics journals be- tween 2002 and 2019 in order to find gender differences in their research approach. We implement an unsupervised machine learning algorithm: the Structural Topic Model (STM), so as to incorporate gender document-level meta-data into a probabilistic text model. This algorithm characterizes jointly the set of latent topics that best fits our data (the set of abstracts) and how the documents/abstracts are allocated to each latent topic. Latent topics are mixtures over words where each word has a probability of belonging to a topic after controlling by journal name and publication year (the meta-data). Thus, the topics may capture research fields but also other more subtle characteristics related to the way in which the articles are written. We find that fe- males are unevenly distributed along the estimated latent topics, by using only data driven methods. This finding relies on “automatically” generated built-in data given the contents in the abstracts of the articles in the T5 journals, without any arbitrary allocation of texts to particular categories (as JEL codes, or research areas).
Databáze: OpenAIRE