面向医药领域的中文语义解析
hinese Semantic Parsing for Medicine Field
语义解析是指将自然语言句子转化成便于机器理解和推理的意义形式。英文语义解析的研究已取得较大进展,中文语义解析的研究工作却寥寥无几。医药领域有丰富的数据资源,由于这些资源都是以文本信息的形式存在,很难被计算机处理。因此,本文针对中文医药领域的特点,提出一种面向医药领域的中文语义解析方法(Chinese Semantic Parsing for Medicine Field, CSPMF),将中文句子转化成其相应的意义表示看作是一个机器翻译的过程。首先构造用于中文语义解析的医药数据集,数据集中每条训练数据包括一个中文句子及其正确的意义表示。然后利用词对齐模型来获取由中文自然语言字符串及其相应的意义表示所组成的双语词典。最后通过学习一个概率估计模型来确定最终的语义解析模型。实验表明,CSPMF有较高的精确度和召回率。
Semantic parsing is the task of transforming natural-language sentences into complete, formal, symbolic meaning representations (MR) suitable for reasoning or machine-understanding. In recent years, the research of semantic parsing in English has made great progress. However, little work has been done in Chinese semantic parsing. The medical field has a rich source of data, which is delivered by text. It is difficult for computer to process and understand the medical data.In this paper, we propose a statistical approach called CSPMF(Chinese Semantic Parsing for Medicine Field)aiming at Chinese semantic parsing for medicine text, which consider the process of converting Chinese sentence into its corresponding meaning as a machine translation procedure. At first, we create a new dataset of medicine for Chinese Semantic Parsing, in which each data contains a Chinese sentence and its accurate meaning. Then we use the word alignment model to acquire the bilingual dictionary made up by the Chinese natural language string and its meaning. In the end, we determine the ultimate semantic analysis by learning a statistical model. Experiments show that CSPMF performs well with higher precision and recall.
高志强、刘倩、吕永涛
医药卫生理论语言学汉语
语义解析自然语言处理医药文本
semantic parsingnatural language processingmedicine text
高志强,刘倩,吕永涛.面向医药领域的中文语义解析[EB/OL].(2017-03-03)[2025-08-04].http://www.paper.edu.cn/releasepaper/content/201703-60.点此复制
评论