Combining Discrete Wavelet and Cosine Transforms for Efficient Sentence Embedding
Combining Discrete Wavelet and Cosine Transforms for Efficient Sentence Embedding
Wavelets have emerged as a cutting edge technology in a number of fields. Concrete results of their application in Image and Signal processing suggest that wavelets can be effectively applied to Natural Language Processing (NLP) tasks that capture a variety of linguistic properties. In this paper, we leverage the power of applying Discrete Wavelet Transforms (DWT) to word and sentence embeddings. We first evaluate, intrinsically and extrinsically, how wavelets can effectively be used to consolidate important information in a word vector while reducing its dimensionality. We further combine DWT with Discrete Cosine Transform (DCT) to propose a non-parameterized model that compresses a sentence with a dense amount of information in a fixed size vector based on locally varying word features. We show the efficacy of the proposed paradigm on downstream applications models yielding comparable and even superior (in some tasks) results to original embeddings.
Rana Salama、Abdou Youssef、Mona Diab
语言学
Rana Salama,Abdou Youssef,Mona Diab.Combining Discrete Wavelet and Cosine Transforms for Efficient Sentence Embedding[EB/OL].(2025-08-01)[2025-08-11].https://arxiv.org/abs/2508.00420.点此复制
评论