Development and Research of an Evolutionary Algorithm for the Formation of a Feature Space based on AutoML for Solving the Problem of Identifying Cyber Attacks

Autor: Lubov Zabrodina, Irina Bolodurina, Denis Parfenov, Arthur Zhigalov, Alexander Shukhman
Rok vydání: 2020
Předmět:
Zdroj: 2020 International Conference Engineering and Telecommunication (En&T).
DOI: 10.1109/ent50437.2020.9431251
Popis: Currently, the use of intelligent data analysis methods allows us to solve applied problems that go beyond simple machine learning and determines the development vector for the possibility to automate all stages of this process. This article presents an approach that allows you to automate the construction and selection of features of raw datasets. The developed method is a logical extension of the ExploreKit algorithm at the stage of generating new feature candidates and calculating meta-objects to represent the dataset and feature candidates. Also, our approach integrated the tree representation of transformations of the AutoFE feature space and methods of evolutionary optimization. Experimental studies on UNSW-NB15, CICDDoS2019, and APA-DDoS traffic datasets have shown that the evolutionary algorithm to form the feature space allows obtaining the necessary accuracy comparable to other algorithms. In addition, the evolutionary approach showed good performance in most cases due to the parallelization of calculations. On the CICDDoS dataset with the highest number of characteristics, the performance of evolutionary approach was 7.4% higher than the Feature Learning by Tree algorithm, but our approach was 13.9% lower than the Feature Learning by Deep Network algorithm. However, the accuracy of FDN algorithm was 2.2 % lower than the accuracy of the evolutionary approach. On the UNSW_NB15 and APA-DDoS datasets, the developed approach showed a decrease in performance, but its accuracy compared to other algorithms is 1.9% higher on average.
Databáze: OpenAIRE