首页|Kajal: Extracting Grammar of a Source Code Using Large Language Models

Kajal: Extracting Grammar of a Source Code Using Large Language Models

来源：

英文摘要

Understanding and extracting the grammar of a domain-specific language (DSL) is crucial for various software engineering tasks; however, manually creating these grammars is time-intensive and error-prone. This paper presents Kajal, a novel approach that automatically infers grammar from DSL code snippets by leveraging Large Language Models (LLMs) through prompt engineering and few-shot learning. Kajal dynamically constructs input prompts, using contextual information to guide the LLM in generating the corresponding grammars, which are iteratively refined through a feedback-driven approach. Our experiments show that Kajal achieves 60% accuracy with few-shot learning and 45% without it, demonstrating the significant impact of few-shot learning on the tool's effectiveness. This approach offers a promising solution for automating DSL grammar extraction, and future work will explore using smaller, open-source LLMs and testing on larger datasets to further validate Kajal's performance.

作者：Mohammad Jalili Torkamani

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Mohammad Jalili Torkamani.Kajal: Extracting Grammar of a Source Code Using Large Language Models[EB/OL].(2024-12-11)[2025-08-14].https://arxiv.org/abs/2412.08842.点此复制

Kajal: Extracting Grammar of a Source Code Using Large Language Models

Kajal: Extracting Grammar of a Source Code Using Large Language Models

评论