Flattened Data in Convolutional Neural Networks
Autor: | Chih-Ta Lin, Shih-Hao Hung, Wan-Ting Yeh, Chih-Wei Yeh |
---|---|
Rok vydání: | 2016 |
Předmět: |
Spatial correlation
Computer science business.industry Dimensionality reduction 020207 software engineering 02 engineering and technology Input format Machine learning computer.software_genre Convolutional neural network Multilayer perceptron 0202 electrical engineering electronic engineering information engineering Neural system Malware 020201 artificial intelligence & image processing Artificial intelligence Android (operating system) business computer |
Zdroj: | RACS |
DOI: | 10.1145/2987386.2987406 |
Popis: | Convolutional Neural Networks (CNNs) are very powerful variants of multilayer perceptron models inspired by human's brain neural system to reveal local, spatial correlation in a series of data. While CNNs are popularly used for image recognition nowadays, it is also possible to apply CNNs in other areas, for example, detection of malicious software. In this paper, we show how CNNs may be used to improve the classification of malicious software due to the high-level feature abstraction and equal-variance property against noises. Taking advantages of convolution kernels, CNNs are naturally born for pattern recognition on images only. For this application, we introduce a new transformation technique which converts a series of event logs into flattened data with two-dimensional features so that CNNs can be trained to detect malicious behaviors effectively. With the combination property and the proposed flattened input format, CNN can perform a k-skip-n-gram dimensionality reduction which learns more flexible and complex patterns comparing to the traditional solutions. Our preliminary results show that our latest CNNs-based malware detection engine reaches 93.012% prediction accuracy and 12.9% FNR under 32,000 samples of a training set. To our knowledge, this is the first paper discussing the application and effectiveness of CNNs on malware detection. |
Databáze: | OpenAIRE |
Externí odkaz: |