首页|HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks

HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks

来源：

英文摘要

Speech Enhancement techniques have become core technologies in mobile devices and voice software simplifying downstream speech tasks. Still, modern Deep Learning (DL) solutions often require high amount of computational resources what makes their usage on low-resource devices challenging. We present HiFi-Stream, an optimized version of recently published HiFi++ model. Our experiments demonstrate that HiFiStream saves most of the qualities of the original model despite its size and computational complexity: the lightest version has only around 490k parameters which is 3.5x reduction in comparison to the original HiFi++ making it one of the smallest and fastest models available. The model is evaluated in streaming setting where it demonstrates its superior performance in comparison to modern baselines.

作者：Ekaterina Dmitrieva、Maksim Kaledin

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Ekaterina Dmitrieva,Maksim Kaledin.HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks[EB/OL].(2025-03-21)[2025-06-08].https://arxiv.org/abs/2503.17141.点此复制

HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks

HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks

评论