|国家预印本平台
首页|Research Borderlands: Analysing Writing Across Research Cultures

Research Borderlands: Analysing Writing Across Research Cultures

Research Borderlands: Analysing Writing Across Research Cultures

来源:Arxiv_logoArxiv
英文摘要

Improving cultural competence of language technologies is important. However most recent works rarely engage with the communities they study, and instead rely on synthetic setups and imperfect proxies of culture. In this work, we take a human-centered approach to discover and measure language-based cultural norms, and cultural competence of LLMs. We focus on a single kind of culture, research cultures, and a single task, adapting writing across research cultures. Through a set of interviews with interdisciplinary researchers, who are experts at moving between cultures, we create a framework of structural, stylistic, rhetorical, and citational norms that vary across research cultures. We operationalise these features with a suite of computational metrics and use them for (a) surfacing latent cultural norms in human-written research papers at scale; and (b) highlighting the lack of cultural competence of LLMs, and their tendency to homogenise writing. Overall, our work illustrates the efficacy of a human-centered approach to measuring cultural norms in human-written and LLM-generated texts.

Shaily Bhatt、Tal August、Maria Antoniak

语言学科学、科学研究

Shaily Bhatt,Tal August,Maria Antoniak.Research Borderlands: Analysing Writing Across Research Cultures[EB/OL].(2025-05-31)[2025-06-30].https://arxiv.org/abs/2506.00784.点此复制

评论