|国家预印本平台
首页|What Prompts Don't Say: Understanding and Managing Underspecification in LLM Prompts

What Prompts Don't Say: Understanding and Managing Underspecification in LLM Prompts

What Prompts Don't Say: Understanding and Managing Underspecification in LLM Prompts

来源:Arxiv_logoArxiv
英文摘要

Building LLM-powered software requires developers to communicate their requirements through natural language, but developer prompts are frequently underspecified, failing to fully capture many user-important requirements. In this paper, we present an in-depth analysis of prompt underspecification, showing that while LLMs can often (41.1%) guess unspecified requirements by default, such behavior is less robust: Underspecified prompts are 2x more likely to regress over model or prompt changes, sometimes with accuracy drops by more than 20%. We then demonstrate that simply adding more requirements to a prompt does not reliably improve performance, due to LLMs' limited instruction-following capabilities and competing constraints, and standard prompt optimizers do not offer much help. To address this, we introduce novel requirements-aware prompt optimization mechanisms that can improve performance by 4.8% on average over baselines that naively specify everything in the prompt. Beyond prompt optimization, we envision that effectively managing prompt underspecification requires a broader process, including proactive requirements discovery, evaluation, and monitoring.

Qianou Ma、Chenyang Yang、Yike Shi、Michael Xieyang Liu、Christian K?stner、Tongshuang Wu

计算技术、计算机技术

Qianou Ma,Chenyang Yang,Yike Shi,Michael Xieyang Liu,Christian K?stner,Tongshuang Wu.What Prompts Don't Say: Understanding and Managing Underspecification in LLM Prompts[EB/OL].(2025-05-19)[2025-07-25].https://arxiv.org/abs/2505.13360.点此复制

评论