|国家预印本平台
首页|Ambiguous Online Learning

Ambiguous Online Learning

Ambiguous Online Learning

来源:Arxiv_logoArxiv
英文摘要

We propose a new variant of online learning that we call "ambiguous online learning". In this setting, the learner is allowed to produce multiple predicted labels. Such an "ambiguous prediction" is considered correct when at least one of the labels is correct, and none of the labels are "predictably wrong". The definition of "predictably wrong" comes from a hypothesis class in which hypotheses are also multi-valued. Thus, a prediction is "predictably wrong" if it's not allowed by the (unknown) true hypothesis. In particular, this setting is natural in the context of multivalued dynamical systems, recommendation algorithms and lossless compression. It is also strongly related to so-called "apple tasting". We show that in this setting, there is a trichotomy of mistake bounds: up to logarithmic factors, any hypothesis class has an optimal mistake bound of either Theta(1), Theta(sqrt(N)) or N.

Vanessa Kosoy

计算技术、计算机技术

Vanessa Kosoy.Ambiguous Online Learning[EB/OL].(2025-06-24)[2025-07-16].https://arxiv.org/abs/2506.19810.点此复制

评论