ProfNER guidelines - English version

Autor: Farré-Maduell, Eulàlia, Lima López, Salvador, Briva-Iglesias, Vicent, Aguero, Marvin, Miranda-Escalada, Antonio, Krallinger, Martin
Jazyk: Spanish; Castilian
Rok vydání: 2021
Předmět:
DOI: 10.5281/zenodo.4479740
Popis: SMM4H 2021 accepted at NAACL (scheduled in Mexico City in June) https://2021.naacl.org/. The ProfNER Shared Task encourages its participants to detect occupations and employment situations in Spanish tweets related to the COVID-19 situation. These guidelines describe the process followed by the clinical and linguist experts who manually annotated the ProfNER corpus. English version of ProfNER annotation guidelines. Please, cite: Miranda-Escalada, A., Farré-Maduell, E., Lima-López, S., Gascó, L., Briva-Iglesias, V., Agüero-Torales, M., & Krallinger, M. (2021, June). The profner shared task on automatic recognition of occupation mentions in social media: systems, evaluation, guidelines, embeddings and corpora. In Proceedings of the Sixth Social Media Mining for Health (# SMM4H) Workshop and Shared Task (pp. 13-20). @inproceedings{miranda2021profner, title={The profner shared task on automatic recognition of occupation mentions in social media: systems, evaluation, guidelines, embeddings and corpora}, author={Miranda-Escalada, Antonio and Farr{\'e}-Maduell, Eul{\`a}lia and Lima-L{\'o}pez, Salvador and Gasc{\'o}, Luis and Briva-Iglesias, Vicent and Ag{\"u}ero-Torales, Marvin and Krallinger, Martin}, booktitle={Proceedings of the Sixth Social Media Mining for Health (\# SMM4H) Workshop and Shared Task}, pages={13--20}, year={2021} } Annotation quality: We have performed a consistency analysis of the corpus. 10% of the documents have been annotated by an internal annotator as well as by the linguist experts following these annotation guidelines. The preliminary Inter-Annotator Agreement (pairwise agreement) is 0.919. For further information, please visit https://temu.bsc.es/smm4h-spanish/ or email us at encargo-pln-life@bsc.es Resources: Web Gold Standard corpus Annnotation Guidelines (in Spanish) FastText COVID-19 Twitter embeddings Occupations gazetteer Conference Proceedings
Funded by the Plan de Impulso de las Tecnologías del Lenguaje (Plan TL).
Databáze: OpenAIRE