|国家预印本平台
首页|Gatsby Without the 'E': Crafting Lipograms with LLMs

Gatsby Without the 'E': Crafting Lipograms with LLMs

Gatsby Without the 'E': Crafting Lipograms with LLMs

来源:Arxiv_logoArxiv
英文摘要

Lipograms are a unique form of constrained writing where all occurrences of a particular letter are excluded from the text, typified by the novel Gadsby, which daringly avoids all usage of the letter 'e'. In this study, we explore the power of modern large language models (LLMs) by transforming the novel F. Scott Fitzgerald's The Great Gatsby into a fully 'e'-less text. We experimented with a range of techniques, from baseline methods like synonym replacement to sophisticated generative models enhanced with beam search and named entity analysis. We show that excluding up to 3.6% of the most common letters (up to the letter 'u') had minimal impact on the text's meaning, although translation fidelity rapidly and predictably decays with stronger lipogram constraints. Our work highlights the surprising flexibility of English under strict constraints, revealing just how adaptable and creative language can be.

Rohan Balasubramanian、Nitish Gokulakrishnan、Syeda Jannatus Saba、Steven Skiena

语言学

Rohan Balasubramanian,Nitish Gokulakrishnan,Syeda Jannatus Saba,Steven Skiena.Gatsby Without the 'E': Crafting Lipograms with LLMs[EB/OL].(2025-05-26)[2025-06-29].https://arxiv.org/abs/2505.20501.点此复制

评论