MADMAX: Browser-Based Malicious Domain Detection Through Extreme Learning Machine

Autor:	Tatsuya Takemura, Ju Chien Cheng, Rei Shimizu, Naoki Umeda, Kazuki Iwahana, Naoto Yanai, Nami Ashizawa, Yuichiro Chinen, Kodai Sato, Ryota Kawakami
Rok vydání:	2021
Předmět:	Source code General Computer Science Computer science media_common.quotation_subject Feature extraction 02 engineering and technology 010501 environmental sciences Machine learning computer.software_genre 01 natural sciences Domain (software engineering) extreme learning machine Permutation feature selection Server 0202 electrical engineering electronic engineering information engineering Selection (linguistics) malicious domain detection General Materials Science real-time training Throughput (business) 0105 earth and related environmental sciences Extreme learning machine media_common business.industry General Engineering TK1-9971 machine learning Browser application 020201 artificial intelligence & image processing Electrical engineering. Electronics. Nuclear engineering Artificial intelligence business computer
Zdroj:	IEEE Access, Vol 9, Pp 78293-78314 (2021)
ISSN:	2169-3536
Popis:	Fast and accurate malicious domain detection is an essential research theme to prevent cybercrime, and machine learning is an attractive approach for detecting unseen malicious domains in the past decade. In this paper, we present MADMAX (MAchine learning-baseD MAlicious domain eXhauster), a browser-based application leveraging extreme learning machine (ELM) for malicious domain detection. In contrast to the existing work of ELM-based domain detection, MADMAX newly introduces two methods, i.e., selection of optimized features to provide higher accuracy and throughput based on permutation importance and real-time training to retrain a model with an updated malicious dataset for continuous malicious domain detection. We demonstrate that MADMAX fairly outperforms the existing work with respect to accuracy and throughput by virtue of the selection of optimized features. Moreover, we also confirm a model with real-time training stably detects even unseen malicious domains, whereas accuracy of a model without the real-time training decreases due to the unseen domains. The source codes of MADMAX is publicly available via GitHub.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::48cf0e784d7a55dd4813b4b6071b7bc1 https://doi.org/10.1109/access.2021.3080456 Zobrazit plný text záznamu