MADMAX: Browser-Based Malicious Domain Detection Through Extreme Learning Machine

Autor: Tatsuya Takemura, Ju Chien Cheng, Rei Shimizu, Naoki Umeda, Kazuki Iwahana, Naoto Yanai, Nami Ashizawa, Yuichiro Chinen, Kodai Sato, Ryota Kawakami
Rok vydání: 2021
Předmět:
Source code
General Computer Science
Computer science
media_common.quotation_subject
Feature extraction
02 engineering and technology
010501 environmental sciences
Machine learning
computer.software_genre
01 natural sciences
Domain (software engineering)
extreme learning machine
Permutation
feature selection
Server
0202 electrical engineering
electronic engineering
information engineering

Selection (linguistics)
malicious domain detection
General Materials Science
real-time training
Throughput (business)
0105 earth and related environmental sciences
Extreme learning machine
media_common
business.industry
General Engineering
TK1-9971
machine learning
Browser application
020201 artificial intelligence & image processing
Electrical engineering. Electronics. Nuclear engineering
Artificial intelligence
business
computer
Zdroj: IEEE Access, Vol 9, Pp 78293-78314 (2021)
ISSN: 2169-3536
Popis: Fast and accurate malicious domain detection is an essential research theme to prevent cybercrime, and machine learning is an attractive approach for detecting unseen malicious domains in the past decade. In this paper, we present MADMAX (MAchine learning-baseD MAlicious domain eXhauster), a browser-based application leveraging extreme learning machine (ELM) for malicious domain detection. In contrast to the existing work of ELM-based domain detection, MADMAX newly introduces two methods, i.e., selection of optimized features to provide higher accuracy and throughput based on permutation importance and real-time training to retrain a model with an updated malicious dataset for continuous malicious domain detection. We demonstrate that MADMAX fairly outperforms the existing work with respect to accuracy and throughput by virtue of the selection of optimized features. Moreover, we also confirm a model with real-time training stably detects even unseen malicious domains, whereas accuracy of a model without the real-time training decreases due to the unseen domains. The source codes of MADMAX is publicly available via GitHub.
Databáze: OpenAIRE