|国家预印本平台
首页|The Medical Metaphors Corpus (MCC)

The Medical Metaphors Corpus (MCC)

The Medical Metaphors Corpus (MCC)

来源:Arxiv_logoArxiv
英文摘要

Metaphor is a fundamental cognitive mechanism that shapes scientific understanding, enabling the communication of complex concepts while potentially constraining paradigmatic thinking. Despite the prevalence of figurative language in scientific discourse, existing metaphor detection resources primarily focus on general-domain text, leaving a critical gap for domain-specific applications. In this paper, we present the Medical Metaphors Corpus (MCC), a comprehensive dataset of 792 annotated scientific conceptual metaphors spanning medical and biological domains. MCC aggregates metaphorical expressions from diverse sources including peer-reviewed literature, news media, social media discourse, and crowdsourced contributions, providing both binary and graded metaphoricity judgments validated through human annotation. Each instance includes source-target conceptual mappings and perceived metaphoricity scores on a 0-7 scale, establishing the first annotated resource for computational scientific metaphor research. Our evaluation demonstrates that state-of-the-art language models achieve modest performance on scientific metaphor detection, revealing substantial room for improvement in domain-specific figurative language understanding. MCC enables multiple research applications including metaphor detection benchmarking, quality-aware generation systems, and patient-centered communication tools.

Anna Sofia Lippolis、Andrea Giovanni Nuzzolese、Aldo Gangemi

医药卫生理论医学研究方法生物科学理论、生物科学方法生物科学研究方法、生物科学研究技术

Anna Sofia Lippolis,Andrea Giovanni Nuzzolese,Aldo Gangemi.The Medical Metaphors Corpus (MCC)[EB/OL].(2025-08-11)[2025-08-24].https://arxiv.org/abs/2508.07993.点此复制

评论