首页|Speech-to-Speech Translation Pipelines for Conversations in Low-Resource Languages

Speech-to-Speech Translation Pipelines for Conversations in Low-Resource Languages

来源：

英文摘要

The popularity of automatic speech-to-speech translation for human conversations is growing, but the quality varies significantly depending on the language pair. In a context of community interpreting for low-resource languages, namely Turkish and Pashto to/from French, we collected fine-tuning and testing data, and compared systems using several automatic metrics (BLEU, COMET, and BLASER) and human assessments. The pipelines included automatic speech recognition, machine translation, and speech synthesis, with local models and cloud-based commercial ones. Some components have been fine-tuned on our data. We evaluated over 60 pipelines and determined the best one for each direction. We also found that the ranks of components are generally independent of the rest of the pipeline.

作者：Andrei Popescu-Belis、Alexis Allemann、Teo Ferrari、Gopal Krishnamani

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Andrei Popescu-Belis,Alexis Allemann,Teo Ferrari,Gopal Krishnamani.Speech-to-Speech Translation Pipelines for Conversations in Low-Resource Languages[EB/OL].(2025-06-02)[2025-06-18].https://arxiv.org/abs/2506.01406.点此复制

Speech-to-Speech Translation Pipelines for Conversations in Low-Resource Languages

Speech-to-Speech Translation Pipelines for Conversations in Low-Resource Languages

评论