|国家预印本平台
首页|Evaluation Report on MCP Servers

Evaluation Report on MCP Servers

Evaluation Report on MCP Servers

来源:Arxiv_logoArxiv
英文摘要

With the rise of LLMs, a large number of Model Context Protocol (MCP) services have emerged since the end of 2024. However, the effectiveness and efficiency of MCP servers have not been well studied. To study these questions, we propose an evaluation framework, called MCPBench. We selected several widely used MCP server and conducted an experimental evaluation on their accuracy, time, and token usage. Our experiments showed that the most effective MCP, Bing Web Search, achieved an accuracy of 64%. Importantly, we found that the accuracy of MCP servers can be substantially enhanced by involving declarative interface. This research paves the way for further investigations into optimized MCP implementations, ultimately leading to better AI-driven applications and data retrieval solutions.

Zhiling Luo、Xiaorong Shi、Xuanrui Lin、Jinyang Gao

计算技术、计算机技术

Zhiling Luo,Xiaorong Shi,Xuanrui Lin,Jinyang Gao.Evaluation Report on MCP Servers[EB/OL].(2025-04-15)[2025-05-14].https://arxiv.org/abs/2504.11094.点此复制

评论