首页|Artificial Intelligence health advice accuracy varies across languages and contexts

Artificial Intelligence health advice accuracy varies across languages and contexts

来源：

英文摘要

Using basic health statements authorized by UK and EU registers and 9,100 journalist-vetted public-health assertions on topics such as abortion, COVID-19 and politics from sources ranging from peer-reviewed journals and government advisories to social media and news across the political spectrum, we benchmark six leading large language models from in 21 languages, finding that, despite high accuracy on English-centric textbook claims, performance falls in multiple non-European languages and fluctuates by topic and source, highlighting the urgency of comprehensive multilingual, domain-aware validation before deploying AI in global health communication.

作者：Prashant Garg、Thiemo Fetzer

作者单位：

学科分类：医学现状、医学发展语言学

推荐引用：Prashant Garg,Thiemo Fetzer.Artificial Intelligence health advice accuracy varies across languages and contexts[EB/OL].(2025-04-25)[2025-06-19].https://arxiv.org/abs/2504.18310.点此复制

Artificial Intelligence health advice accuracy varies across languages and contexts

Artificial Intelligence health advice accuracy varies across languages and contexts

评论