|国家预印本平台
首页|HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks

HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks

HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks

来源:Arxiv_logoArxiv
英文摘要

Speech Enhancement techniques have become core technologies in mobile devices and voice software simplifying downstream speech tasks. Still, modern Deep Learning (DL) solutions often require high amount of computational resources what makes their usage on low-resource devices challenging. We present HiFi-Stream, an optimized version of recently published HiFi++ model. Our experiments demonstrate that HiFiStream saves most of the qualities of the original model despite its size and computational complexity: the lightest version has only around 490k parameters which is 3.5x reduction in comparison to the original HiFi++ making it one of the smallest and fastest models available. The model is evaluated in streaming setting where it demonstrates its superior performance in comparison to modern baselines.

Ekaterina Dmitrieva、Maksim Kaledin

计算技术、计算机技术

Ekaterina Dmitrieva,Maksim Kaledin.HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks[EB/OL].(2025-03-21)[2025-06-08].https://arxiv.org/abs/2503.17141.点此复制

评论