|国家预印本平台
首页|SQUiD: Synthesizing Relational Databases from Unstructured Text

SQUiD: Synthesizing Relational Databases from Unstructured Text

SQUiD: Synthesizing Relational Databases from Unstructured Text

来源:Arxiv_logoArxiv
英文摘要

Relational databases are central to modern data management, yet most data exists in unstructured forms like text documents. To bridge this gap, we leverage large language models (LLMs) to automatically synthesize a relational database by generating its schema and populating its tables from raw text. We introduce SQUiD, a novel neurosymbolic framework that decomposes this task into four stages, each with specialized techniques. Our experiments show that SQUiD consistently outperforms baselines across diverse datasets.

Mushtari Sadia、Zhenning Yang、Yunming Xiao、Ang Chen、Amrita Roy Chowdhury

计算技术、计算机技术

Mushtari Sadia,Zhenning Yang,Yunming Xiao,Ang Chen,Amrita Roy Chowdhury.SQUiD: Synthesizing Relational Databases from Unstructured Text[EB/OL].(2025-05-25)[2025-07-16].https://arxiv.org/abs/2505.19025.点此复制

评论