PHRASED: Phrase Dictionary Biasing for Speech Translation
PHRASED: Phrase Dictionary Biasing for Speech Translation
Phrases are essential to understand the core concepts in conversations. However, due to their rare occurrence in training data, correct translation of phrases is challenging in speech translation tasks. In this paper, we propose a phrase dictionary biasing method to leverage pairs of phrases mapping from the source language to the target language. We apply the phrase dictionary biasing method to two types of widely adopted models, a transducer-based streaming speech translation model and a multimodal large language model. Experimental results show that the phrase dictionary biasing method outperforms phrase list biasing by 21% relatively for the streaming speech translation model. In addition, phrase dictionary biasing enables multimodal large language models to use external phrase information, achieving 85% relative improvement in phrase recall.
Peidong Wang、Jian Xue、Rui Zhao、Junkun Chen、Aswin Shanmugam Subramanian、Jinyu Li
语言学计算技术、计算机技术
Peidong Wang,Jian Xue,Rui Zhao,Junkun Chen,Aswin Shanmugam Subramanian,Jinyu Li.PHRASED: Phrase Dictionary Biasing for Speech Translation[EB/OL].(2025-06-10)[2025-07-16].https://arxiv.org/abs/2506.09175.点此复制
评论