Statistical network protocol identification with unknown pattern extraction

Autor:	Yu Wang, Hanxiao Xue, Waixi Liu, Yang Liu
Rok vydání:	2019
Předmět:	Protocol (science) Scheme (programming language) Network security business.industry Computer science 020206 networking & telecommunications Context (language use) 02 engineering and technology computer.software_genre Identification (information) Statistical classification Traffic classification 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Data mining Electrical and Electronic Engineering Communications protocol business computer computer.programming_language
Zdroj:	Annals of Telecommunications. 74:473-482
ISSN:	1958-9395 0003-4347
DOI:	10.1007/s12243-019-00704-y
Popis:	Network traffic classification is an enabling technique for network security and management for both traditional networks and emerging networks such as Internet of Things. Due to the decreasing effectiveness of traditional port-based and payload-based methods, lots of research attentions are devoted to an alternative approach based on flow and packet-level traffic characteristics. A variety of statistical classification schemes are proposed in this context, but most of them embody an implicit assumption that all protocols are known in advance and well presented in the training data. This assumption is unrealistic because real-world networks constantly witness emerging traffic patterns and protocols that are previously unknown. In this paper, we revisit the problem by proposing a learning scheme with unknown pattern extraction for statistical protocol identification. The scheme is designed with a more realistic setting, in which we assume that the training data only consists of labeled samples from a limited number of protocols, and the goal is to identify these known patterns out of arbitrary traffic mixture of both known and unknown protocols. Our experiments based on real-world traffic show that the proposed scheme outperforms previous approaches by accurately identifying both known and unknown protocols.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::125775d74c73730d50d75c252b73b177 https://doi.org/10.1007/s12243-019-00704-y Zobrazit plný text záznamu Full text from SpringerLink