ResearchCodeBench: Benchmarking LLMs on Implementing Novel Machine Learning Research Code
Tianyu Hua Harper Hua Violet Xiang Benjamin Klieger Sang T. Truong Weixin Liang Fan-Yun Sun Nick Haber
Tianyu Hua Harper Hua Violet Xiang Benjamin Klieger Sang T. Truong Weixin Liang Fan-Yun Sun Nick Haber
评论