Solving Quantitative Reasoning Problems with Language Models
Solving Quantitative Reasoning Problems with Language Models
Language models have achieved remarkable performance on a wide range of tasks that require natural language understanding. Nevertheless, state-of-the-art models have generally struggled with tasks that require quantitative reasoning, such as solving mathematics, science, and engineering problems at the college level. To help close this gap, we introduce Minerva, a large language model pretrained on general natural language data and further trained on technical content. The model achieves state-of-the-art performance on technical benchmarks without the use of external tools. We also evaluate our model on over two hundred undergraduate-level problems in physics, biology, chemistry, economics, and other sciences that require quantitative reasoning, and find that the model can correctly answer nearly a third of them.
Guy Gur-Ari、Vinay Ramasesh、Behnam Neyshabur、Henryk Michalewski、Imanol Schlag、Vedant Misra、Anders Andreassen、Yuhuai Wu、Ethan Dyer、Theo Gutman-Solo、Aitor Lewkowycz、Ambrose Slone、Cem Anil、David Dohan
数学物理学化学生物科学理论、生物科学方法生物化学经济学
Guy Gur-Ari,Vinay Ramasesh,Behnam Neyshabur,Henryk Michalewski,Imanol Schlag,Vedant Misra,Anders Andreassen,Yuhuai Wu,Ethan Dyer,Theo Gutman-Solo,Aitor Lewkowycz,Ambrose Slone,Cem Anil,David Dohan.Solving Quantitative Reasoning Problems with Language Models[EB/OL].(2022-06-29)[2025-05-16].https://arxiv.org/abs/2206.14858.点此复制
评论