首页|Alternating Approach-Putt Models for Multi-Stage Speech Enhancement

Alternating Approach-Putt Models for Multi-Stage Speech Enhancement

来源：

英文摘要

Speech enhancement using artificial neural networks aims to remove noise from noisy speech signals while preserving the speech content. However, speech enhancement networks often introduce distortions to the speech signal, referred to as artifacts, which can degrade audio quality. In this work, we propose a post-processing neural network designed to mitigate artifacts introduced by speech enhancement models. Inspired by the analogy of making a `Putt' after an `Approach' in golf, we name our model PuttNet. We demonstrate that alternating between a speech enhancement model and the proposed Putt model leads to improved speech quality, as measured by perceptual quality scores (PESQ), objective intelligibility (STOI), and background noise intrusiveness (CBAK) scores. Furthermore, we illustrate with graphical analysis why this alternating Approach outperforms repeated application of either model alone.

作者：Iksoon Jeong、Kyung-Joong Kim、Kang-Hun Ahn

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Iksoon Jeong,Kyung-Joong Kim,Kang-Hun Ahn.Alternating Approach-Putt Models for Multi-Stage Speech Enhancement[EB/OL].(2025-08-14)[2025-08-24].https://arxiv.org/abs/2508.10436.点此复制

Alternating Approach-Putt Models for Multi-Stage Speech Enhancement

Alternating Approach-Putt Models for Multi-Stage Speech Enhancement

评论