NIT_Agartala_NLP_Team at SemEval-2019 task 6: An ensemble approach to identifying and categorizing offensive language in twitter social media corpora

Autor:	Björn Gambäck, Amitava Das, Anupam Jamatia, Steve Durairaj Swamy
Předmět:	Ensemble forecasting business.industry Computer science Deep learning Offensive 02 engineering and technology computer.software_genre SemEval Task (project management) 020204 information systems 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Social media Artificial intelligence business computer Natural language processing
Zdroj:	Scopus-Elsevier SemEval@NAACL-HLT
Popis:	The paper describes the systems submitted to OffensEval (SemEval 2019, Task 6) on ‘Identifying and Categorizing Offensive Language in Social Media’ by the ‘NIT_Agartala_NLP_Team’. A Twitter annotated dataset of 13,240 English tweets was provided by the task organizers to train the individual models, with the best results obtained using an ensemble model composed of six different classifiers. The ensemble model produced macro-averaged F1-scores of 0.7434, 0.7078 and 0.4853 on Subtasks A, B, and C, respectively. The paper highlights the overall low predictive nature of various linguistic features and surface level count features, as well as the limitations of a traditional machine learning approach when compared to a Deep Learning counterpart. licensed on a Creative Commons Attribution 4.0 International License.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6f090b634a5984edfc2a29d9d255b8b7 http://www.scopus.com/inward/record.url?eid=2-s2.0-85093449550&partnerID=MN8TOARS Zobrazit plný text záznamu