|国家预印本平台
首页|数据流集成分类算法综述

数据流集成分类算法综述

中文摘要英文摘要

当前,数据流分类算法的潮流是集成分类算法,因为集成算法提供了比单分类算法更好的性能和更突出的表现。同时在现实世界的实际应用中容易部署,对概念漂移有快速的适应性和恢复性,而且在类不平衡问题的处理中也具有最佳的分类性能。详细介绍了国内外集成分类算法,对集成分类算法的两个部分(基分类器组合和动态更新集成模型)进行了详细综述,明确区分不同集成算法的优缺点,对比算法和实验数据集。并且提出进一步的研究方向和考虑的解决办法。

urrently, the trend of data stream classification algorithms is to ensemble classification algorithms. Because the ensemble algorithm provides better performance and more outstanding performance than the single classification algorithm. At the same time, it is easy to deploy in practical applications in the real world, has rapid adaptability and recovery to concept drift, and has the best classification performance in the processing of class imbalance problems. Based on the outstanding features and performance of the above ensemble classification algorithm, it has won extensive research by scholars at home and abroad. This paper introduces the ensemble classification algorithm at home and abroad in detail. The two parts of the ensemble classification algorithm (base classifier combination and dynamic update ensemble model) are reviewed in detail, and the advantages and disadvantages of different integration algorithms, comparison algorithm and experimental data set are clearly distinguished. The paper proposed further research Directions and considerations.

许冠英、王少峰、韩萌、贾涛

10.12074/201811.00172V1

计算技术、计算机技术

数据流分类集成学习概念漂移

许冠英,王少峰,韩萌,贾涛.数据流集成分类算法综述[EB/OL].(2018-11-29)[2025-08-02].https://chinaxiv.org/abs/201811.00172.点此复制

评论