|国家预印本平台
首页|Heterogeneous co-occurrence embedding for visual information exploration

Heterogeneous co-occurrence embedding for visual information exploration

Heterogeneous co-occurrence embedding for visual information exploration

来源:Arxiv_logoArxiv
英文摘要

This paper proposes an embedding method for co-occurrence data aimed at visual information exploration. We consider cases where co-occurrence probabilities are measured between pairs of elements from heterogeneous domains. The proposed method maps these heterogeneous elements into corresponding two-dimensional latent spaces, enabling visualization of asymmetric relationships between the domains. The key idea is to embed the elements in a way that maximizes their mutual information, thereby preserving the original dependency structure as much as possible. This approach can be naturally extended to cases involving three or more domains, using a generalization of mutual information known as total correlation. For inter-domain analysis, we also propose a visualization method that assigns colors to the latent spaces based on conditional probabilities, allowing users to explore asymmetric relationships interactively. We demonstrate the utility of the method through applications to an adjective-noun dataset, the NeurIPS dataset, and a subject-verb-object dataset, showcasing both intra- and inter-domain analysis.

Takuro Ishida、Tetsuo Furukawa

计算技术、计算机技术

Takuro Ishida,Tetsuo Furukawa.Heterogeneous co-occurrence embedding for visual information exploration[EB/OL].(2025-08-25)[2025-09-05].https://arxiv.org/abs/2508.17663.点此复制

评论