|国家预印本平台
首页|Structure of Classifier Boundaries: Case Study for a Naive Bayes Classifier

Structure of Classifier Boundaries: Case Study for a Naive Bayes Classifier

Structure of Classifier Boundaries: Case Study for a Naive Bayes Classifier

来源:Arxiv_logoArxiv
英文摘要

Whether based on models, training data or a combination, classifiers place (possibly complex) input data into one of a relatively small number of output categories. In this paper, we study the structure of the boundary--those points for which a neighbor is classified differently--in the context of an input space that is a graph, so that there is a concept of neighboring inputs, The scientific setting is a model-based naive Bayes classifier for DNA reads produced by Next Generation Sequencers. We show that the boundary is both large and complicated in structure. We create a new measure of uncertainty, called Neighbor Similarity, that compares the result for a point to the distribution of results for its neighbors. This measure not only tracks two inherent uncertainty measures for the Bayes classifier, but also can be implemented, at a computational cost, for classifiers without inherent measures of uncertainty.

Zac Bowen、Adam A. Porter、Alan F. Karr

计算技术、计算机技术

Zac Bowen,Adam A. Porter,Alan F. Karr.Structure of Classifier Boundaries: Case Study for a Naive Bayes Classifier[EB/OL].(2022-12-08)[2025-08-02].https://arxiv.org/abs/2212.04382.点此复制

评论