|国家预印本平台
首页|Measuring Time-Series Dataset Similarity using Wasserstein Distance

Measuring Time-Series Dataset Similarity using Wasserstein Distance

Measuring Time-Series Dataset Similarity using Wasserstein Distance

来源:Arxiv_logoArxiv
英文摘要

The emergence of time-series foundation model research elevates the growing need to measure the (dis)similarity of time-series datasets. A time-series dataset similarity measure aids research in multiple ways, including model selection, finetuning, and visualization. In this paper, we propose a distribution-based method to measure time-series dataset similarity by leveraging the Wasserstein distance. We consider a time-series dataset an empirical instantiation of an underlying multivariate normal distribution (MVN). The similarity between two time-series datasets is thus computed as the Wasserstein distance between their corresponding MVNs. Comprehensive experiments and visualization show the effectiveness of our approach. Specifically, we show how the Wasserstein distance helps identify similar time-series datasets and facilitates inference performance estimation of foundation models in both out-of-distribution and transfer learning evaluation, with high correlations between our proposed measure and the inference loss (>0.60).

Hongjie Chen、Akshay Mehra、Josh Kimball、Ryan A. Rossi

计算技术、计算机技术

Hongjie Chen,Akshay Mehra,Josh Kimball,Ryan A. Rossi.Measuring Time-Series Dataset Similarity using Wasserstein Distance[EB/OL].(2025-07-29)[2025-08-06].https://arxiv.org/abs/2507.22189.点此复制

评论