Using data visualization technique to detect sensitive information re-identification problem of real open dataset
Autor: | Chiun-How Kao, Yu-Ting Kuang, Chuan-Kai Yang, Yu-Feng Chu, Chih-Hung Hsieh |
---|---|
Rok vydání: | 2017 |
Předmět: |
business.industry
Computer science 020207 software engineering 02 engineering and technology computer.software_genre Data science Visualization Information sensitivity Open data Data visualization Hardware and Architecture 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing The Internet Data mining business Personally identifiable information computer Software Record linkage Vulnerability (computing) |
Zdroj: | Journal of Systems Architecture. 80:85-91 |
ISSN: | 1383-7621 |
DOI: | 10.1016/j.sysarc.2017.09.009 |
Popis: | With plenty valuable information, open data are often deemed as great assets to academia or industry. In spite of some de-identification processing that most of data owners will perform before releasing the data, the more datasets are opened to public, the more likely personal privacy will be exposed. According to previous real case studies, even though the personally identifiable information has been de-identified, sensitive personal information could still be uncovered by heterogeneous or cross-domain data joining operations. The involved privacy re-identification processes are usually too complicated or obscure to be realized by data owners, not to mention that this problem will be more severe as the scale of data will get larger and larger. For preventing the leakage of sensitive information, this paper shows how to use a novel visualization analysis tool for open data de-identification (ODD Visualizer) to verify whether there exists sensitive information leakage problem in the target datasets. The high effectiveness that the ODD Visualizer can provide mainly comes from implementing a scalable computing platform as well as developing an efficient data visualization technique. Our demonstrations show that the ODD Visualizer can indeed uncover real vulnerability of record linkage attacks among open datasets available on the Internet. |
Databáze: | OpenAIRE |
Externí odkaz: |