首页|Optimal Scalogram for Computational Complexity Reduction in Acoustic Recognition Using Deep Learning

Optimal Scalogram for Computational Complexity Reduction in Acoustic Recognition Using Deep Learning

来源：

英文摘要

The Continuous Wavelet Transform (CWT) is an effective tool for feature extraction in acoustic recognition using Convolutional Neural Networks (CNNs), particularly when applied to non-stationary audio. However, its high computational cost poses a significant challenge, often leading researchers to prefer alternative methods such as the Short-Time Fourier Transform (STFT). To address this issue, this paper proposes a method to reduce the computational complexity of CWT by optimizing the length of the wavelet kernel and the hop size of the output scalogram. Experimental results demonstrate that the proposed approach significantly reduces computational cost while maintaining the robust performance of the trained model in acoustic recognition tasks.

作者：Dang Thoai Phan、Tuan Anh Huynh、Van Tuan Pham、Cao Minh Tran、Van Thuan Mai、Ngoc Quy Tran

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Dang Thoai Phan,Tuan Anh Huynh,Van Tuan Pham,Cao Minh Tran,Van Thuan Mai,Ngoc Quy Tran.Optimal Scalogram for Computational Complexity Reduction in Acoustic Recognition Using Deep Learning[EB/OL].(2025-05-19)[2025-07-16].https://arxiv.org/abs/2505.13017.点此复制

Optimal Scalogram for Computational Complexity Reduction in Acoustic Recognition Using Deep Learning

Optimal Scalogram for Computational Complexity Reduction in Acoustic Recognition Using Deep Learning

评论