首页|Evaluation Report on MCP Servers

Evaluation Report on MCP Servers

来源：

英文摘要

With the rise of LLMs, a large number of Model Context Protocol (MCP) services have emerged since the end of 2024. However, the effectiveness and efficiency of MCP servers have not been well studied. To study these questions, we propose an evaluation framework, called MCPBench. We selected several widely used MCP server and conducted an experimental evaluation on their accuracy, time, and token usage. Our experiments showed that the most effective MCP, Bing Web Search, achieved an accuracy of 64%. Importantly, we found that the accuracy of MCP servers can be substantially enhanced by involving declarative interface. This research paves the way for further investigations into optimized MCP implementations, ultimately leading to better AI-driven applications and data retrieval solutions.

作者：Zhiling Luo、Xiaorong Shi、Xuanrui Lin、Jinyang Gao

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Zhiling Luo,Xiaorong Shi,Xuanrui Lin,Jinyang Gao.Evaluation Report on MCP Servers[EB/OL].(2025-04-15)[2025-05-14].https://arxiv.org/abs/2504.11094.点此复制

Evaluation Report on MCP Servers

Evaluation Report on MCP Servers

评论