Presenting a classifier to detect research contributions in OpenAlex
Presenting a classifier to detect research contributions in OpenAlex
This paper introduces a document type classifier with the purpose to optimise the distinction between research and non-research journal publications in OpenAlex. Based on open metadata, the classifier can detect non-research or editorial content within a set of classified articles and reviews (e.g. paratexts, abstracts, editorials, letters). The classifier achieves an F1-score of 0,95, indicating a potential improvement in the data quality of bibliometric research in OpenAlex when applying the classifier on real data. In total, 4.589.967 out of 42.701.863 articles and reviews could be reclassified as non-research contributions by the classifier, representing a share of 10,75%
Nick Haupka
计算技术、计算机技术
Nick Haupka.Presenting a classifier to detect research contributions in OpenAlex[EB/OL].(2025-07-30)[2025-08-06].https://arxiv.org/abs/2507.22479.点此复制
评论