首页|SAIBench: Benchmarking AI for Science

SAIBench: Benchmarking AI for Science

来源：

英文摘要

Scientific research communities are embracing AI-based solutions to target tractable scientific tasks and improve research workflows. However, the development and evaluation of such solutions are scattered across multiple disciplines. We formalize the problem of scientific AI benchmarking, and propose a system called SAIBench in the hope of unifying the efforts and enabling low-friction on-boarding of new disciplines. The system approaches this goal with SAIL, a domain-specific language to decouple research problems, AI models, ranking criteria, and software/hardware configuration into reusable modules. We show that this approach is flexible and can adapt to problems, AI models, and evaluation methods defined in different perspectives. The project homepage is https://www.computercouncil.org/SAIBench

作者：Jianfeng Zhan、Yatao Li

作者单位：

DOI：10.1016/j.tbench.2022.100063

学科分类：信息科学、信息技术自然科学研究方法计算技术、计算机技术

推荐引用：Jianfeng Zhan,Yatao Li.SAIBench: Benchmarking AI for Science[EB/OL].(2022-06-11)[2025-07-25].https://arxiv.org/abs/2206.05418.点此复制

SAIBench: Benchmarking AI for Science

SAIBench: Benchmarking AI for Science

评论