Cyclic Multichannel Wiener Filter for Acoustic Beamforming
Cyclic Multichannel Wiener Filter for Acoustic Beamforming
Acoustic beamforming models typically assume wide-sense stationarity of speech signals within short time frames. However, voiced speech is better modeled as a cyclostationary (CS) process, a random process whose mean and autocorrelation are $T_1$-periodic, where $α_1=1/T_1$ corresponds to the fundamental frequency of vowels. Higher harmonic frequencies are found at integer multiples of the fundamental. This work introduces a cyclic multichannel Wiener filter (cMWF) for speech enhancement derived from a cyclostationary model. This beamformer exploits spectral correlation across the harmonic frequencies of the signal to further reduce the mean-squared error (MSE) between the target and the processed input. The proposed cMWF is optimal in the MSE sense and reduces to the MWF when the target is wide-sense stationary. Experiments on simulated data demonstrate considerable improvements in scale-invariant signal-to-distortion ratio (SI-SDR) on synthetic data but also indicate high sensitivity to the accuracy of the estimated fundamental frequency $α_1$, which limits effectiveness on real data.
Giovanni Bologni、Richard Heusdens、Richard C. Hendriks
通信
Giovanni Bologni,Richard Heusdens,Richard C. Hendriks.Cyclic Multichannel Wiener Filter for Acoustic Beamforming[EB/OL].(2025-07-14)[2025-07-25].https://arxiv.org/abs/2507.10159.点此复制
评论