|国家预印本平台
首页|垃圾邮件的概念漂移及过滤技术研究

垃圾邮件的概念漂移及过滤技术研究

Research on concept drift and filtering in spam ecosystem

中文摘要英文摘要

垃圾邮件与垃圾邮件过滤构成了相互博弈的生态系统。一个长期有效的垃圾邮件过滤技术必须能够自适应的应对垃圾邮件随时间和用户偏好而产生的各种变化,这种变化在机器学习领域中被称为概念漂移。提出双级别的概念漂移检测算法,监视已有的垃圾邮件过滤模型在对邮件分类时是否产生了持续的分类错误,进而对概念漂移进行识别。针对由用户偏好引起的垃圾邮件概念范畴变化,基于本体论提出邮件数字指纹与概念子类别之间的关联强度和隶属度的算法。通过对比实验,验证了所提方法在垃圾邮件概念漂移问题上的有效性。

Spam and spam filtering constitutes a game spam ecosystems. A long-term effective spam filtering technology should self-adaptive response to kinds of spam variations generated with time and user preferences, and which is known as concept drift in machine learning area. The dual-level concept drift detection algorithm was proposed to discern concept drift in which sustained misclassification was monitored when email classification. Regarding the spam concept scope changes caused by user preferences, the association and subjection strength algorithm was proposed between email fingerprints and concept subcategories based ontology. The proposed method was proved effective to handle the concept drift problem in spam filtering by comparing experiment.

殷爱茹、师文轩

计算技术、计算机技术

机器学习垃圾邮件过滤概念漂移数字指纹

machine learningspam filteringconcept driftfingerprinting

殷爱茹,师文轩.垃圾邮件的概念漂移及过滤技术研究[EB/OL].(2014-06-12)[2025-08-17].http://www.paper.edu.cn/releasepaper/content/201406-178.点此复制

评论