Measures of Overlapping Multivariate Gaussian Clusters in Unsupervised Online Learning
Measures of Overlapping Multivariate Gaussian Clusters in Unsupervised Online Learning
In this paper, we propose a new measure for detecting overlap in multivariate Gaussian clusters. The aim of online learning from data streams is to create clustering, classification, or regression models that can adapt over time based on the conceptual drift of streaming data. In the case of clustering, this can result in a large number of clusters that may overlap and should be merged. Commonly used distribution dissimilarity measures are not adequate for determining overlapping clusters in the context of online learning from streaming data due to their inability to account for all shapes of clusters and their high computational demands. Our proposed dissimilarity measure is specifically designed to detect overlap rather than dissimilarity and can be computed faster compared to existing measures. Our method is several times faster than compared methods and is capable of detecting overlapping clusters while avoiding the merging of orthogonal clusters.
Miha Ožbot、Igor Škrjanc
计算技术、计算机技术
Miha Ožbot,Igor Škrjanc.Measures of Overlapping Multivariate Gaussian Clusters in Unsupervised Online Learning[EB/OL].(2025-08-21)[2025-09-02].https://arxiv.org/abs/2508.15444.点此复制
评论