|国家预印本平台
首页|基于个人敏感信息的数据脱敏技术研究

基于个人敏感信息的数据脱敏技术研究

Research on data masking technology based on personal sensitive information

中文摘要英文摘要

数据是新时代的核心生产要素,各个行业间的数据是流动、共享和开放的,由于数据的非竞争性,数据在流动开放中的泄露等数据安全风险愈发突出。针对现有的敏感数据识别方法难以扩展且识别准确率和识别性能低下问题,本文提出了基于先分类后识别架构的敏感数据识别方法(简称:SDICBIA)。实验结果表明,与传统的基于正则匹配的敏感数据识别方式相比,SDICBIA的识别准确率能够达到97.14%,识别性能提升能够达到70%左右。其次,本文设计了一套支持自动智能识别敏感数据以及能够进行自动化数据脱敏,并且可用于防止个人敏感信息泄露的数据脱敏系统。该系统主要包括三个功能模块:多源异构的数据源同步适配管理模块、敏感数据自动识别模块、数据脱敏模块。其中,数据脱敏系统针对结构化数据应用场景进行设计和实现。

In the new age, it is the core production factor that data is flowing, shared and open among various industries. Due to the non-competitive nature of data, data security risks such as data leakage in the flow and openness are becoming more and more prominent. To address the problem that existing sensitive data recognition methods are difficult to scale and have low recognition accuracy and recognition performance, this paper proposes a sensitive data recognition method (abbreviated as: SDICBIA) based on classification first and recognition later architecture. It is shown that the recognition accuracy of SDICBIA can reach 97.14% and the recognition performance can be improved by about 70% compared with the traditional regular matching-based sensitive data recognition method. Secondly, in this paper, we design a data masking system that supports automatic and intelligent recognition of sensitive data as well as automatic data masking and can be used to prevent leakage of sensitive personal information. It mainly includes three functional modules: synchronous adaptation management module of data sources with multiple heterogeneous sources, automatic identification module of sensitive data, and data masking module. In which, the data masking system is designed and implemented for structured data application scene.

张业兴

计算技术、计算机技术

数据脱敏数据加密数据安全敏感数据

data maskingdata encryptiondata securitysensitive data

张业兴.基于个人敏感信息的数据脱敏技术研究[EB/OL].(2021-11-05)[2025-06-21].http://www.paper.edu.cn/releasepaper/content/202111-13.点此复制

评论