CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Artificial Intelligence (AI), along with the recent progress in biomedical language understanding, is gradually changing medical practice. With the development of biomedical language understanding benchmarks, AI applications are widely used in the medical field. However, most benchmarks are limited to English, which makes it challenging to replicate many of the successes in English for other languages. To facilitate research in this direction, we collect real-world biomedical data and present the first Chinese Biomedical Language Understanding Evaluation (CBLUE) benchmark: a collection of natural language understanding tasks including named entity recognition, information extraction, clinical diagnosis normalization, single-sentence/sentence-pair classification, and an associated online platform for model evaluation, comparison, and analysis. To establish evaluation on these tasks, we report empirical results with the current 11 pre-trained Chinese models, and experimental results show that state-of-the-art neural models perform by far worse than the human ceiling. Our benchmark is released at \url{https://tianchi.aliyun.com/dataset/dataDetail?dataId=95414&lang=en-us}.
Kangping Yin、Xin Shang、Ningyu Zhang、Qingcai Chen、Mosha Chen、Xiaozhuan Liang、Fei Huang、Chuanqi Tan、Zhifang Sui、Linfeng Li、Hui Zong、Baobao Chang、Zhen Bi、Lei Li、Buzhou Tang、Zheng Yuan、Guotong Xie、Jun Yan、Jian Xu、Yuan Ni、Hongying Zan、Kunli Zhang、Luo Si
医药卫生理论生物科学理论、生物科学方法语言学
Kangping Yin,Xin Shang,Ningyu Zhang,Qingcai Chen,Mosha Chen,Xiaozhuan Liang,Fei Huang,Chuanqi Tan,Zhifang Sui,Linfeng Li,Hui Zong,Baobao Chang,Zhen Bi,Lei Li,Buzhou Tang,Zheng Yuan,Guotong Xie,Jun Yan,Jian Xu,Yuan Ni,Hongying Zan,Kunli Zhang,Luo Si.CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark[EB/OL].(2021-06-15)[2025-08-02].https://arxiv.org/abs/2106.08087.点此复制
评论