Application Research of File Fingerprint Identification Detection Based on a Network Security Protection System

Autor: Sheng Shen, Yingwei Li, Lina Yu, Chunwei Wang, Fang Hou, Huixian Chang
Rok vydání: 2020
Předmět:
Zdroj: Wireless Communications and Mobile Computing, Vol 2020 (2020)
ISSN: 1530-8677
1530-8669
Popis: A DLP (data loss prevention) system usually arranges network monitors at the network boundary to perform network traffic capture, file parsing, and strategy matching procedures. Strategy matching is a key process to prevent corporate secret-related documents from leaking. This paper adopts the document fingerprint similarity detection method based on the SimHash principle and customizes the KbS (Keyword-based SimHash) fingerprint, PbS (Paragraph-based SimHash) fingerprint, and SoP (SimHash of Paragraph) fingerprint, three different feature extraction SimHash algorithms for strategy matching to detect. The parsed unstructured data is stored as a file type in.txt format, and then a file fingerprint is generated. Matching the established sensitive document library to calculate the Hamming distance between the fingerprints, the Hamming distance values under different modification degrees are summarized. The experimental results reveal that the hybrid algorithmic strategy matching rules with different levels and accuracy are established. This paper has a reference role for the leakage prevention research of enterprise sensitive data.
Databáze: OpenAIRE