|国家预印本平台
首页|An Adaptive Data-Resilient Multi-Modal Framework for Hierarchical Multi-Label Book Genre Identification

An Adaptive Data-Resilient Multi-Modal Framework for Hierarchical Multi-Label Book Genre Identification

An Adaptive Data-Resilient Multi-Modal Framework for Hierarchical Multi-Label Book Genre Identification

来源:Arxiv_logoArxiv
英文摘要

Identifying the finer details of a book's genres enhances user experience by enabling efficient book discovery and personalized recommendations, ultimately improving reader engagement and satisfaction. It also provides valuable insights into market trends and consumer preferences, allowing publishers and marketers to make data-driven decisions regarding book production and marketing strategies. While traditional book genre classification methods primarily rely on review data or textual analysis, incorporating additional modalities, such as book covers, blurbs, and metadata, can offer richer context and improve prediction accuracy. However, the presence of incomplete or noisy information across these modalities presents a significant challenge. This paper introduces IMAGINE (Intelligent Multi-modal Adaptive Genre Identification NEtwork), a framework designed to address these complexities. IMAGINE extracts robust feature representations from multiple modalities and dynamically selects the most informative sources based on data availability. It employs a hierarchical classification strategy to capture genre relationships and remains adaptable to varying input conditions. Additionally, we curate a hierarchical genre classification dataset that structures genres into a well-defined taxonomy, accommodating the diverse nature of literary works. IMAGINE integrates information from multiple sources and assigns multiple genre labels to each book, ensuring a more comprehensive classification. A key feature of our framework is its resilience to incomplete data, enabling accurate predictions even when certain modalities, such as text, images, or metadata, are missing or incomplete. Experimental results show that IMAGINE outperformed existing baselines in genre classification accuracy, particularly in scenarios with insufficient modality-specific data.

Chandranath Adak、Soumi Chattopadhyay、Arjab Roy、Prolay Mallick、Suraj Kumar、Ayush Vikas Daga、Adarsh Wase、Utsav Kumar Nareti

计算技术、计算机技术

Chandranath Adak,Soumi Chattopadhyay,Arjab Roy,Prolay Mallick,Suraj Kumar,Ayush Vikas Daga,Adarsh Wase,Utsav Kumar Nareti.An Adaptive Data-Resilient Multi-Modal Framework for Hierarchical Multi-Label Book Genre Identification[EB/OL].(2025-05-05)[2025-05-28].https://arxiv.org/abs/2505.03839.点此复制

评论