|国家预印本平台
首页|Multilingual Sentiment Analysis of Summarized Texts: A Cross-Language Study of Text Shortening Effects

Multilingual Sentiment Analysis of Summarized Texts: A Cross-Language Study of Text Shortening Effects

Multilingual Sentiment Analysis of Summarized Texts: A Cross-Language Study of Text Shortening Effects

来源:Arxiv_logoArxiv
英文摘要

Summarization significantly impacts sentiment analysis across languages with diverse morphologies. This study examines extractive and abstractive summarization effects on sentiment classification in English, German, French, Spanish, Italian, Finnish, Hungarian, and Arabic. We assess sentiment shifts post-summarization using multilingual transformers (mBERT, XLM-RoBERTa, T5, and BART) and language-specific models (FinBERT, AraBERT). Results show extractive summarization better preserves sentiment, especially in morphologically complex languages, while abstractive summarization improves readability but introduces sentiment distortion, affecting sentiment accuracy. Languages with rich inflectional morphology, such as Finnish, Hungarian, and Arabic, experience greater accuracy drops than English or German. Findings emphasize the need for language-specific adaptations in sentiment analysis and propose a hybrid summarization approach balancing readability and sentiment preservation. These insights benefit multilingual sentiment applications, including social media monitoring, market analysis, and cross-lingual opinion mining.

Mikhail Krasitskii、Grigori Sidorov、Olga Kolesnikova、Liliana Chanona Hernandez、Alexander Gelbukh

语言学常用外国语乌拉尔语系(芬兰-乌戈尔语系)印欧语系闪-含语系(阿非罗-亚细亚语系)

Mikhail Krasitskii,Grigori Sidorov,Olga Kolesnikova,Liliana Chanona Hernandez,Alexander Gelbukh.Multilingual Sentiment Analysis of Summarized Texts: A Cross-Language Study of Text Shortening Effects[EB/OL].(2025-03-31)[2025-07-03].https://arxiv.org/abs/2504.00265.点此复制

评论