|国家预印本平台
首页|Unveiling Challenges for LLMs in Enterprise Data Engineering

Unveiling Challenges for LLMs in Enterprise Data Engineering

Unveiling Challenges for LLMs in Enterprise Data Engineering

来源:Arxiv_logoArxiv
英文摘要

Large Language Models (LLMs) have demonstrated significant potential for automating data engineering tasks on tabular data, giving enterprises a valuable opportunity to reduce the high costs associated with manual data handling. However, the enterprise domain introduces unique challenges that existing LLM-based approaches for data engineering often overlook, such as large table sizes, more complex tasks, and the need for internal knowledge. To bridge these gaps, we identify key enterprise-specific challenges related to data, tasks, and background knowledge and conduct a comprehensive study of their impact on recent LLMs for data engineering. Our analysis reveals that LLMs face substantial limitations in real-world enterprise scenarios, resulting in significant accuracy drops. Our findings contribute to a systematic understanding of LLMs for enterprise data engineering to support their adoption in industry.

Jan-Micha Bodensohn、Ulf Brackmann、Liane Vogel、Anupam Sanghi、Carsten Binnig

计算技术、计算机技术

Jan-Micha Bodensohn,Ulf Brackmann,Liane Vogel,Anupam Sanghi,Carsten Binnig.Unveiling Challenges for LLMs in Enterprise Data Engineering[EB/OL].(2025-04-15)[2025-06-05].https://arxiv.org/abs/2504.10950.点此复制

评论