|国家预印本平台
首页|Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems

Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems

Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems

来源:Arxiv_logoArxiv
英文摘要

End-to-End Automatic Speech Recognition (ASR) has advanced significantly yet still struggles with rare and domain-specific entities. This paper introduces a simple yet efficient prompt-based biasing technique for contextualized ASR, enhancing recognition accuracy by leverage a unified multitask learning framework. The approach comprises two key components: a prompt biasing model which is trained to determine when to focus on entities in prompt, and a entity filtering mechanism which efficiently filters out irrelevant entities. Our method significantly enhances ASR accuracy on entities, achieving a relative 30.7% and 18.0% reduction in Entity Word Error Rate compared to the baseline model with shallow fusion on in-house domain dataset with small and large entity lists, respectively. The primary advantage of this method lies in its efficiency and simplicity without any structure change, making it lightweight and highly efficient.

Bo Ren、Yu Shi、Jinyu Li

计算技术、计算机技术

Bo Ren,Yu Shi,Jinyu Li.Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems[EB/OL].(2025-06-06)[2025-07-25].https://arxiv.org/abs/2506.06252.点此复制

评论