|国家预印本平台
首页|Understanding discrepancies in the coverage of OpenAlex: the case of China

Understanding discrepancies in the coverage of OpenAlex: the case of China

Understanding discrepancies in the coverage of OpenAlex: the case of China

来源:Arxiv_logoArxiv
英文摘要

Citation indexes play a crucial role for understanding how science is produced, disseminated, and used. However, these databases often face a critical trade-off: those offering extensive and high-quality coverage are typically proprietary, whereas publicly accessible datasets frequently exhibit fragmented coverage and inconsistent data quality. OpenAlex was developed to address this challenge, providing a freely available database with broad open coverage, with a particular emphasis on non-English speaking countries. Yet, few studies have assessed the quality of the OpenAlex dataset. This paper assesses the coverage, by OpenAlex, of China's papers, which shows an abnormal trend, and compares it with other countries that do not have English as their main language. Our analysis reveals that while OpenAlex increases the coverage of China's publications, primarily those disseminated by a national database, this coverage is incomplete and discontinuous when compared to other countries' records in the database. We observe similar issues in other non-English-speaking countries, with coverage varying across regions. These findings indicate that although OpenAlex expands coverage of research outputs, continuity issues persist and disproportionately affect certain countries. We emphasize the need for researchers to use OpenAlex data cautiously, being mindful of its potential limitations in cross-national analyses.

Mengxue Zheng、Lili Miao、Yi Bu、Vincent Larivière

信息传播、知识传播科学、科学研究自然科学研究方法信息科学、信息技术

Mengxue Zheng,Lili Miao,Yi Bu,Vincent Larivière.Understanding discrepancies in the coverage of OpenAlex: the case of China[EB/OL].(2025-07-28)[2025-08-10].https://arxiv.org/abs/2507.19302.点此复制

评论