Mining Social Science Publications for Survey Variables

Autor: Zielinski, Andrea, Mutschke, Peter
Přispěvatelé: Association for Computational Linguistics (ACL)
Rok vydání: 2017
Předmět:
Zdroj: Proceedings of the Second Workshop on NLP and Computational Social Science, 47-52
Druh dokumentu: Konferenzbeitrag<br />conference paper<br />conferenceObject
Popis: Research in Social Science is usually based on survey data where individual research questions relate to observable concepts (variables). However, due to a lack of standards for data citations a reliable identification of the variables used is often difficult. In this paper, we present a work-in-progress study that seeks to provide a solution to the variable detection task based on supervised machine learning algorithms, using a linguistic analysis pipeline to extract a rich feature set, including terminological concepts and similarity metric scores. Further, we present preliminary results on a small dataset that has been specifically designed for this task, yielding modest improvements over the baseline.
Databáze: SSOAR – Social Science Open Access Repository