|国家预印本平台
首页|Evaluating K-Fold Cross Validation for Transformer Based Symbolic Regression Models

Evaluating K-Fold Cross Validation for Transformer Based Symbolic Regression Models

Evaluating K-Fold Cross Validation for Transformer Based Symbolic Regression Models

来源:Arxiv_logoArxiv
英文摘要

Symbolic Regression remains an NP-Hard problem, with extensive research focusing on AI models for this task. Transformer models have shown promise in Symbolic Regression, but performance suffers with smaller datasets. We propose applying k-fold cross-validation to a transformer-based symbolic regression model trained on a significantly reduced dataset (15,000 data points, down from 500,000). This technique partitions the training data into multiple subsets (folds), iteratively training on some while validating on others. Our aim is to provide an estimate of model generalization and mitigate overfitting issues associated with smaller datasets. Results show that this process improves the model's output consistency and generalization by a relative improvement in validation loss of 53.31%. Potentially enabling more efficient and accessible symbolic regression in resource-constrained environments.

Kaustubh Kislay、Shlok Singh、Soham Joshi、Rohan Dutta、Jay Shim、George Flint、Kevin Zhu

数学计算技术、计算机技术

Kaustubh Kislay,Shlok Singh,Soham Joshi,Rohan Dutta,Jay Shim,George Flint,Kevin Zhu.Evaluating K-Fold Cross Validation for Transformer Based Symbolic Regression Models[EB/OL].(2025-06-30)[2025-08-02].https://arxiv.org/abs/2410.21896.点此复制

评论