Early Identification of Abused Domains in TLD through Passive DNS Applying Machine Learning Techniques
Autor: | Leandro Marcos Da Silva |
---|---|
Rok vydání: | 2022 |
Předmět: | |
Zdroj: | International Journal of Communication Networks and Information Security (IJCNIS). 14 |
ISSN: | 2073-607X 2076-0930 |
DOI: | 10.17762/ijcnis.v14i1.5256 |
Popis: | DNS is vital for the proper functioning of the Internet. However, users use this structure for domain registration and abuse. These domains are used as tools for these users to carry out the most varied attacks. Thus, early detection of abused domains prevents more people from falling into scams. In this work, an approach for identifying abused domains was developed using passive DNS collected from an authoritative DNS server TLD along with the data enriched through geolocation, thus enabling a global view of the domains. Therefore, the system monitors the domain’s first seven days of life after its first DNS query, in which two behavior checks are performed, the first with three days and the second with seven days. The generated models apply the machine learning algorithm LightGBM, and because of the unbalanced data, the combination of Cluster Centroids and K-Means SMOTE techniques were used. As a result, it obtained an average AUC of 0.9673 for the three-day model and an average AUC of 0.9674 for the seven-day model. Finally, the validation of three and seven days in a test environment reached a TPR of 0.8656 and 0.8682, respectively. It was noted that the system has a satisfactory performance for the early identification of abused domains and the importance of a TLD to identify these domains. |
Databáze: | OpenAIRE |
Externí odkaz: |