IMT Mines Ales at HASOC 2019: Automatic Hate Speech Detection
Autor: | Jean-Christophe Mensonides, Pierre-Antoine Jean, Andon Tchechmedjiev, Sébastien Harispe |
---|---|
Přispěvatelé: | Laboratoire de Génie Informatique et Ingénierie de Production (LGI2P), IMT - MINES ALES (IMT - MINES ALES), Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT), IMT - Mines Alès, Administrateur |
Jazyk: | angličtina |
Rok vydání: | 2019 |
Předmět: | |
Zdroj: | FIRE 2019-11th Forum for Information Retrieval Evaluation FIRE 2019-11th Forum for Information Retrieval Evaluation, Dec 2019, Kolkata, India. p.279-284 HAL |
Popis: | International audience; This paper presents the contribution of the LGI2P (Labo-ratoire de Génie Informatique et d'Ingénierie de Production) team from IMT Mines Alès to the Hate Speech and Offensive Content Identification in Indo-European Languages (HASOC) 2019 shared task. This challenge aims at automatically identifying hate speech content in social media through three sub-tasks, each available in three different languages (En-glish, German and Hindi). We are interested in sub-tasks A and B, requiring to (A) classify tweets as offensive or as non offensive, and (B) to further classify offensive tweets from sub-task A as hate speech, offensive speech or profane. We trained a fastText model for each proposed language and obtained promising results on the Hindi dataset for both sub-tasks A and B. |
Databáze: | OpenAIRE |
Externí odkaz: |