|国家预印本平台
首页|Causal Graph Discovery with Retrieval-Augmented Generation based Large Language Models

Causal Graph Discovery with Retrieval-Augmented Generation based Large Language Models

Causal Graph Discovery with Retrieval-Augmented Generation based Large Language Models

来源:Arxiv_logoArxiv
英文摘要

Causal graph recovery is traditionally done using statistical estimation-based methods or based on individual's knowledge about variables of interests. They often suffer from data collection biases and limitations of individuals' knowledge. The advance of large language models (LLMs) provides opportunities to address these problems. We propose a novel method that leverages LLMs to deduce causal relationships in general causal graph recovery tasks. This method leverages knowledge compressed in LLMs and knowledge LLMs extracted from scientific publication database as well as experiment data about factors of interest to achieve this goal. Our method gives a prompting strategy to extract associational relationships among those factors and a mechanism to perform causality verification for these associations. Comparing to other LLM-based methods that directly instruct LLMs to do the highly complex causal reasoning, our method shows clear advantage on causal graph quality on benchmark datasets. More importantly, as causality among some factors may change as new research results emerge, our method show sensitivity to new evidence in the literature and can provide useful information for updating causal graphs accordingly.

Chen Wang、Yipeng Zhang、Yuzhe Zhang、Lina Yao、Yidong Gan

自然科学研究方法信息科学、信息技术计算技术、计算机技术

Chen Wang,Yipeng Zhang,Yuzhe Zhang,Lina Yao,Yidong Gan.Causal Graph Discovery with Retrieval-Augmented Generation based Large Language Models[EB/OL].(2024-02-23)[2025-08-05].https://arxiv.org/abs/2402.15301.点此复制

评论