|国家预印本平台
首页|Semantics at an Angle: When Cosine Similarity Works Until It Doesn't

Semantics at an Angle: When Cosine Similarity Works Until It Doesn't

Semantics at an Angle: When Cosine Similarity Works Until It Doesn't

来源:Arxiv_logoArxiv
英文摘要

Cosine similarity has become a standard metric for comparing embeddings in modern machine learning. Its scale-invariance and alignment with model training objectives have contributed to its widespread adoption. However, recent studies have revealed important limitations, particularly when embedding norms carry meaningful semantic information. This informal article offers a reflective and selective examination of the evolution, strengths, and limitations of cosine similarity. We highlight why it performs well in many settings, where it tends to break down, and how emerging alternatives are beginning to address its blind spots. We hope to offer a mix of conceptual clarity and practical perspective, especially for quantitative scientists who think about embeddings not just as vectors, but as geometric and philosophical objects.

Kisung You

计算技术、计算机技术

Kisung You.Semantics at an Angle: When Cosine Similarity Works Until It Doesn't[EB/OL].(2025-04-22)[2025-05-05].https://arxiv.org/abs/2504.16318.点此复制

评论