|国家预印本平台
首页|Multilingual Prompting for Improving LLM Generation Diversity

Multilingual Prompting for Improving LLM Generation Diversity

Multilingual Prompting for Improving LLM Generation Diversity

来源:Arxiv_logoArxiv
英文摘要

Large Language Models (LLMs) are known to lack cultural representation and overall diversity in their generations, from expressing opinions to answering factual questions. To mitigate this problem, we propose multilingual prompting: a prompting method which generates several variations of a base prompt with added cultural and linguistic cues from several cultures, generates responses, and then combines the results. Building on evidence that LLMs have language-specific knowledge, multilingual prompting seeks to increase diversity by activating a broader range of cultural knowledge embedded in model training data. Through experiments across multiple models (GPT-4o, GPT-4o-mini, LLaMA 70B, and LLaMA 8B), we show that multilingual prompting consistently outperforms existing diversity-enhancing techniques such as high-temperature sampling, step-by-step recall, and personas prompting. Further analyses show that the benefits of multilingual prompting vary with language resource level and model size, and that aligning the prompting language with the cultural cues reduces hallucination about culturally-specific information.

Qihan Wang、Shidong Pan、Tal Linzen、Emily Black

语言学文化理论

Qihan Wang,Shidong Pan,Tal Linzen,Emily Black.Multilingual Prompting for Improving LLM Generation Diversity[EB/OL].(2025-05-21)[2025-06-04].https://arxiv.org/abs/2505.15229.点此复制

评论