|国家预印本平台
首页|Pivot Language for Low-Resource Machine Translation

Pivot Language for Low-Resource Machine Translation

Pivot Language for Low-Resource Machine Translation

来源:Arxiv_logoArxiv
英文摘要

Certain pairs of languages suffer from lack of a parallel corpus which is large in size and diverse in domain. One of the ways this is overcome is via use of a pivot language. In this paper we use Hindi as a pivot language to translate Nepali into English. We describe what makes Hindi a good candidate for the pivot. We discuss ways in which a pivot language can be used, and use two such approaches - the Transfer Method (fully supervised) and Backtranslation (semi-supervised) - to translate Nepali into English. Using the former, we are able to achieve a devtest Set SacreBLEU score of 14.2, which improves the baseline fully supervised score reported by (Guzman et al., 2019) by 6.6 points. While we are slightly below the semi-supervised baseline score of 15.1, we discuss what may have caused this under-performance, and suggest scope for future work.

Abhimanyu Talwar、Julien Laasri

语言学印欧语系

Abhimanyu Talwar,Julien Laasri.Pivot Language for Low-Resource Machine Translation[EB/OL].(2025-05-20)[2025-07-16].https://arxiv.org/abs/2505.14553.点此复制

评论