|国家预印本平台
首页|Tabular Feature Discovery With Reasoning Type Exploration

Tabular Feature Discovery With Reasoning Type Exploration

Tabular Feature Discovery With Reasoning Type Exploration

来源:Arxiv_logoArxiv
英文摘要

Feature engineering for tabular data remains a critical yet challenging step in machine learning. Recently, large language models (LLMs) have been used to automatically generate new features by leveraging their vast knowledge. However, existing LLM-based approaches often produce overly simple or repetitive features, partly due to inherent biases in the transformations the LLM chooses and the lack of structured reasoning guidance during generation. In this paper, we propose a novel method REFeat, which guides an LLM to discover diverse and informative features by leveraging multiple types of reasoning to steer the feature generation process. Experiments on 59 benchmark datasets demonstrate that our approach not only achieves higher predictive accuracy on average, but also discovers more diverse and meaningful features. These results highlight the promise of incorporating rich reasoning paradigms and adaptive strategy selection into LLM-driven feature discovery for tabular data.

Sungwon Han、Sungkyu Park、Seungeon Lee

计算技术、计算机技术

Sungwon Han,Sungkyu Park,Seungeon Lee.Tabular Feature Discovery With Reasoning Type Exploration[EB/OL].(2025-06-25)[2025-07-16].https://arxiv.org/abs/2506.20357.点此复制

评论