首页|PPT: A Process-based Preference Learning Framework for Self Improving Table Question Answering Models

PPT: A Process-based Preference Learning Framework for Self Improving Table Question Answering Models

来源：

英文摘要

Improving large language models (LLMs) with self-generated data has demonstrated success in tasks such as mathematical reasoning and code generation. Yet, no exploration has been made on table question answering (TQA), where a system answers questions based on tabular data. Addressing this gap is crucial for TQA, as effective self-improvement can boost performance without requiring costly or manually annotated data. In this work, we propose PPT, a Process-based Preference learning framework for TQA. It decomposes reasoning chains into discrete states, assigns scores to each state, and samples contrastive steps for preference learning. Experimental results show that PPT effectively improves TQA models by up to 5% on in-domain datasets and 2.4% on out-of-domain datasets, with only 8,000 preference pairs. Furthermore, the resulting models achieve competitive results compared to more complex and larger state-of-the-art TQA systems, while being five times more efficient during inference.

作者：Wei Zhou、Mohsen Mesgar、Heike Adel、Annemarie Friedrich

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Wei Zhou,Mohsen Mesgar,Heike Adel,Annemarie Friedrich.PPT: A Process-based Preference Learning Framework for Self Improving Table Question Answering Models[EB/OL].(2025-05-23)[2025-06-15].https://arxiv.org/abs/2505.17565.点此复制

PPT: A Process-based Preference Learning Framework for Self Improving Table Question Answering Models

PPT: A Process-based Preference Learning Framework for Self Improving Table Question Answering Models

评论