多模态营养知识图谱构建
饮食是人们生活中的关键一环。近年来,随着生活水平的提高,人们越来越注重饮食的健康与个性化。为了精准、有效、直观地为不同人群提供营养和饮食建议,构建多模态营养知识图谱是十分必要的。本文结合营养学书籍、文献和网站中的营养数据,构建了包含食物、营养、疾病等实体的多模态营养知识图谱。借鉴OneRel模型,完成中文实体关系联合抽取。通过利用感知哈希(pHash)算法对获取的食物图像数据进行过滤,使用RoBERTa-ResNet模型分别学习文本和图像数据特征,并文本、图像特征向量进行拼接得到融合向量,通过加入全连接层学习模态间的表层特征,辅助构建多模态知识图谱。最后,利用Neo4j图数据库对多模态营养知识图谱进行存储和可视化展示。本文提出的跨模态领域知识图谱构建方法构建的营养多模态知识图谱不仅能系统化地整合营养领域多模态知识,实现良好的可视化查询,也是智能问答、营养推荐系统等下游任务的底层支撑。
Diet is a crucial aspect of peoples lives. In recent years, with the improvement of living standards, people have increasingly focused on the health and personalization of their diets. To provide precise, effective, and intuitive nutritional and dietary recommendations for different populations, it is essential to construct a multimodal nutrition knowledge graph. This paper constructs a multimodal nutrition knowledge graph that includes entities such as food, nutrition, and diseases by integrating nutritional data from nutrition books, literature, and websites. By referencing the OneRel model, joint extraction of Chinese entity relationships is completed. The perceptual hash (pHash) algorithm is used to filter the acquired food image data, and the RoBERTa-ResNet model is employed to learn the features of text and image data separately. The text and image feature vectors are concatenated to form a fused vector, and a fully connected layer is added to learn the superficial features between modalities, which aids in the construction of the multimodal knowledge graph. Finally, the Neo4j graph database is utilized to store and visually display the multimodal nutrition knowledge graph. The multimodal nutrition knowledge graph constructed using the cross-modal knowledge graph construction method proposed in this paper not only systematically integrates multimodal knowledge in the field of nutrition and enables good visual query capabilities but also serves as the underlying support for downstream tasks such as intelligent question answering and nutrition recommendation systems.
车美龄、南嘉乐、林建海、高东平
医学研究方法生物科学研究方法、生物科学研究技术计算技术、计算机技术
多模态知识图谱知识表示健康饮食
Multimodal knowledge graphKnowledge representationHealthy diet
车美龄,南嘉乐,林建海,高东平.多模态营养知识图谱构建[EB/OL].(2025-05-22)[2025-08-02].https://www.biomedrxiv.org.cn/article/doi/bmr.202505.00042.点此复制
评论