Team RobertNLP at the BioCreative VII LitCovid track: neural document classification using SciBERT

Autor:	Chandra Pujari, Subhash, Tarsi, Tim, Strötgen, Jannik, Friedrich, Annemarie
Jazyk:	angličtina
Rok vydání:	2023
Předmět:	ddc:004
Popis:	This paper describes our submission to the BioCreative VII LitCovid track Multi-label topic classification for COVID-19 literature annotation. Our system generates embeddings for title, abstract, and keywords using the transformer-based pre-trained language model SciBERT. The classification layer consists of several multi-layer perceptrons, each predicting the applicability of a single label. Our approach, originally developed for hierarchical patent classification, shows a strong performance on the LitCovid shared task, outperforming roughly 75% of the participating systems. Keywords—document representation; multi-task learning; multi-label classification.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=od______3341::d54b63ac4e803e90ea026d7e1607d878 https://opus.bibliothek.uni-augsburg.de/opus4/frontdoor/index/index/docId/105623 Zobrazit plný text záznamu