A Model Based on LSTM Neural Networks to Identify Five Different Types of Malware

Autor:	Eduardo de O. Andrade, José Viterbo, Joris Guérin, Cristina Nader Vasconcelos, Flavia Bernardini
Rok vydání:	2019
Předmět:	Artificial neural network business.industry Computer science Deep learning 020206 networking & telecommunications 02 engineering and technology computer.software_genre Machine learning Multiclass classification ComputingMethodologies_PATTERNRECOGNITION Recurrent neural network 0202 electrical engineering electronic engineering information engineering General Earth and Planetary Sciences Malware 020201 artificial intelligence & image processing Artificial intelligence business computer General Environmental Science
Zdroj:	KES
ISSN:	1877-0509
DOI:	10.1016/j.procs.2019.09.173
Popis:	Identifying malware has always been a great challenge. Much money and time has been invested by companies and governments to mitigate the impact of these threats. Nowadays, with the increasing amount of data available, it is possible to use more precise classification techniques. However, most large datasets that include malicious and non-malicious softwares are not public, which hinders the quest for solutions based in technologies that rely on the availability of large amounts of data, such as deep learning. To overcome this limitation, this article introduces a new large dataset for malware classification, which was made publicly available. We then propose a model to train a multiclass classification recurrent neural network (RNN), more specifically a long short-term memory neural network (LSTM) on our dataset. This model for analyzing unstructured malware data is then tested on unseen programs and the accuracy obtained reaches 67.60%, including six classes with five different types of malware.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::a476ff7d09858885d3e5b52c81b4e48e https://doi.org/10.1016/j.procs.2019.09.173 Zobrazit plný text záznamu