Comparative Analysis of Machine Learning Methods Application for Financial Fraud Detection

Autor:	Alexander Menshchikov, Vladislav Perfilev, Denis Roenko, Maksim Zykin, Maksim Fedosenko
Jazyk:	angličtina
Rok vydání:	2022
Předmět:	big data fraud detection machine learning fraudulent transactions classification methods anti-fraud supervised learning ensemble learning classification performance Telecommunication TK5101-6720
Zdroj:	Proceedings of the XXth Conference of Open Innovations Association FRUCT, Vol 32, Iss 1, Pp 178-186 (2022)
Druh dokumentu:	article
ISSN:	2305-7254 2343-0737
DOI:	10.23919/FRUCT56874.2022.9953872
Popis:	This paper addresses the fraud detection problem in the context of Big Data used in remote banking systems. The paper aims to propose a new algorithm for automatic detection of fraudulent transactions using machine learning with a performance that allows to apply it in big data systems. The article identifies promising directions for optimizing the operation of methods for fraudulent transactions detection in anti-fraud systems. Architectural approaches to the operation of anti-fraud systems have been studied. Based on this, an architecture for illegal actions prediction in a near real-time mode was proposed. The research task of the article is to find the most suitable machine learning algorithm, with the least training and prediction time, demonstrating high classification performance. To achieve this goal, an analysis of the supervised and ensemble machine learning algorithms was made. The dataset was preprocessed for the experiment with SMOTE resampling and robust scaling techniques. The chosen methods were compared using different metrics: f1 score, AUC and time consumption for training and classification. As a result of a metrics comparison, it was found that multilayer perceptron (MLP) and boosting methods (Adaptive, Gradient, XGBoost) has the highest classification, but MLP outperforms boosting methods in terms of time consumption for classification. Thus, MLP was selected as the most appropriate algorithm for further integration to proposed Big Data architecture. Based on the data obtained during the experiments, the degree of their implementation in fraud detection systems was assessed and architecture for the anti-fraud detection system for big data was proposed.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/ef1d77a4d177471b8f6da93d3c61da26 Zobrazit plný text záznamu View record in DOAJ