Safeguarding Online Communications using DistilRoBERTa for Detection of Terrorism and Offensive Chats

Autor:	Mohamed Safwan Saalik Shah, Amr Mohamed Abuaieta, Shaima Saeed Almazrouei
Jazyk:	angličtina
Rok vydání:	2024
Předmět:	social media offensive language large language models distilroberta model Criminal law and procedure K5000-5582 Cybernetics Q300-390
Zdroj:	Journal of Information Security and Cybercrimes Research, Vol 7, Iss 1, Pp 93-107 (2024)
Druh dokumentu:	article
ISSN:	1658-7782 1658-7790
DOI:	10.26735/VNVR2791
Popis:	People use social media for both good and distasteful purposes. When used with malicious intent, it raises significant concerns as it involves the use of offensive language and hate speech that promote terrorism and other negative behaviors. To create a safe, secure and pleasant environment, these communications must be closely monitored to prevent severe problems, associated risks and other pertinent issues. With the help of AI, specifically Large Language Models (LLM), we can quickly analyze text and speech to determine whether the communications promote the dangers identified here above not to mention other toxic elements. For this research, the LLM used is the DistilRoBERTa model from the Transformers library using Hugging Face. The DistilRoBERTa model was trained on datasets consisting of terrorism-related conversations, offensive-related conversations, and neutral conversations. These datasets were obtained from publicly available sources. The results of the experimentation show that the model achieved 99% accuracy, precision, recall, F1 score, and ROC curve. To improve the robustness of the model, it must be continuously fine-tuned to predict dynamic communication behavior since real conversations are inaccessible due to restrictions. A drag-and-drop interface is used to upload the files and get the categorical output, ensuring seamless and easy interaction.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/81998ecdcbb94bc892edb879c4d91392 Zobrazit plný text záznamu View record in DOAJ