Can Artificial Intelligence Write Like Borges? An Evaluation Protocol for Spanish Microfiction
Can Artificial Intelligence Write Like Borges? An Evaluation Protocol for Spanish Microfiction
Automated story writing has been a subject of study for over 60 years. Large language models can generate narratively consistent and linguistically coherent short fiction texts. Despite these advancements, rigorous assessment of such outputs for literary merit - especially concerning aesthetic qualities - has received scant attention. In this paper, we address the challenge of evaluating AI-generated microfictions and argue that this task requires consideration of literary criteria across various aspects of the text, such as thematic coherence, textual clarity, interpretive depth, and aesthetic quality. To facilitate this, we present GrAImes: an evaluation protocol grounded in literary theory, specifically drawing from a literary perspective, to offer an objective framework for assessing AI-generated microfiction. Furthermore, we report the results of our validation of the evaluation protocol, as answered by both literature experts and literary enthusiasts. This protocol will serve as a foundation for evaluating automatically generated microfictions and assessing their literary value.
Gerardo Aleman Manzanarez、Nora de la Cruz Arana、Jorge Garcia Flores、Yobany Garcia Medina、Raul Monroy、Nathalie Pernelle
计算技术、计算机技术语言学
Gerardo Aleman Manzanarez,Nora de la Cruz Arana,Jorge Garcia Flores,Yobany Garcia Medina,Raul Monroy,Nathalie Pernelle.Can Artificial Intelligence Write Like Borges? An Evaluation Protocol for Spanish Microfiction[EB/OL].(2025-06-09)[2025-06-21].https://arxiv.org/abs/2506.08172.点此复制
评论