|国家预印本平台
首页|Author Name Disambiguation in Bibliographic Databases: A Survey

Author Name Disambiguation in Bibliographic Databases: A Survey

Author Name Disambiguation in Bibliographic Databases: A Survey

来源:Arxiv_logoArxiv
英文摘要

Entity resolution is a challenging and hot research area in the field of Information Systems since last decade. Author Name Disambiguation (AND) in Bibliographic Databases (BD) like DBLP , Citeseer , and Scopus is a specialized field of entity resolution. Given many citations of underlying authors, the AND task is to find which citations belong to the same author. In this survey, we start with three basic AND problems, followed by need for solution and challenges. A generic, five-step framework is provided for handling AND issues. These steps are; (1) Preparation of dataset (2) Selection of publication attributes (3) Selection of similarity metrics (4) Selection of models and (5) Clustering Performance evaluation. Categorization and elaboration of similarity metrics and methods are also provided. Finally, future directions and recommendations are given for this dynamic area of research.

Tehmina Amjad、Muhammad Shoaib、Ali Daud

计算技术、计算机技术

Tehmina Amjad,Muhammad Shoaib,Ali Daud.Author Name Disambiguation in Bibliographic Databases: A Survey[EB/OL].(2020-04-14)[2025-07-23].https://arxiv.org/abs/2004.06391.点此复制

评论