首页|ConExion: Concept Extraction with Large Language Models

ConExion: Concept Extraction with Large Language Models

来源：

英文摘要

In this paper, an approach for concept extraction from documents using pre-trained large language models (LLMs) is presented. Compared with conventional methods that extract keyphrases summarizing the important information discussed in a document, our approach tackles a more challenging task of extracting all present concepts related to the specific domain, not just the important ones. Through comprehensive evaluations of two widely used benchmark datasets, we demonstrate that our method improves the F1 score compared to state-of-the-art techniques. Additionally, we explore the potential of using prompts within these models for unsupervised concept extraction. The extracted concepts are intended to support domain coverage evaluation of ontologies and facilitate ontology learning, highlighting the effectiveness of LLMs in concept extraction tasks. Our source code and datasets are publicly available at https://github.com/ISE-FIZKarlsruhe/concept_extraction.

作者：Ebrahim Norouzi、Sven Hertling、Harald Sack

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Ebrahim Norouzi,Sven Hertling,Harald Sack.ConExion: Concept Extraction with Large Language Models[EB/OL].(2025-04-17)[2025-07-16].https://arxiv.org/abs/2504.12915.点此复制

ConExion: Concept Extraction with Large Language Models

ConExion: Concept Extraction with Large Language Models

评论