{"title":"基于句法依赖关系特征的特殊临床问题分类新方法","authors":"Mourad Sarrouti, Abdelmonaime Lachkar","doi":"10.1504/IJBRA.2017.10003490","DOIUrl":null,"url":null,"abstract":"Clinical question classification is an important and a challenging task for any clinical Question Answering (QA) system. It classifies questions into different semantic categories, which indicate the expected semantic type of answers. Indeed, the semantic category allows filtering out irrelevant answer candidates. Existing methods dealing with the problem of clinical question classification don't take into account the syntactic dependency relations in questions. Therefore, this may impact negatively the performance of the clinical question classification system. To overcome this drawback, we propose to incorporate the syntactic dependency relations as discriminative features for machine learning. To evaluate and illustrate the interest of our contribution, we conduct a comparative study using nine methods and two machine-learning algorithms: Naive Bayes and Support Vector Machine (SVM). The obtained results using 4654 clinical questions maintained by the National Library of Medicine (NLM) show that our proposed method is very efficient and outperforms greatly the others by the average F-score of 4.5% for Naive Bayes and 4.73% for SVM.","PeriodicalId":434900,"journal":{"name":"Int. J. Bioinform. Res. Appl.","volume":"187 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"A new and efficient method based on syntactic dependency relations features for ad hoc clinical question classification\",\"authors\":\"Mourad Sarrouti, Abdelmonaime Lachkar\",\"doi\":\"10.1504/IJBRA.2017.10003490\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clinical question classification is an important and a challenging task for any clinical Question Answering (QA) system. It classifies questions into different semantic categories, which indicate the expected semantic type of answers. Indeed, the semantic category allows filtering out irrelevant answer candidates. Existing methods dealing with the problem of clinical question classification don't take into account the syntactic dependency relations in questions. Therefore, this may impact negatively the performance of the clinical question classification system. To overcome this drawback, we propose to incorporate the syntactic dependency relations as discriminative features for machine learning. To evaluate and illustrate the interest of our contribution, we conduct a comparative study using nine methods and two machine-learning algorithms: Naive Bayes and Support Vector Machine (SVM). The obtained results using 4654 clinical questions maintained by the National Library of Medicine (NLM) show that our proposed method is very efficient and outperforms greatly the others by the average F-score of 4.5% for Naive Bayes and 4.73% for SVM.\",\"PeriodicalId\":434900,\"journal\":{\"name\":\"Int. J. Bioinform. Res. Appl.\",\"volume\":\"187 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-03-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. J. Bioinform. Res. Appl.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1504/IJBRA.2017.10003490\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Bioinform. Res. Appl.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJBRA.2017.10003490","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
摘要
临床问题分类是任何临床问答系统的一项重要且具有挑战性的任务。它将问题分为不同的语义类别,这些类别表明了期望的答案的语义类型。事实上,语义分类允许过滤掉不相关的候选答案。现有的临床问题分类方法没有考虑问题的句法依赖关系。因此,这可能会对临床问题分类系统的性能产生负面影响。为了克服这个缺点,我们建议将句法依赖关系作为机器学习的判别特征。为了评估和说明我们的贡献的兴趣,我们使用九种方法和两种机器学习算法进行了比较研究:朴素贝叶斯和支持向量机(SVM)。使用美国国家医学图书馆(National Library of Medicine, NLM)保存的4654个临床问题的结果表明,我们提出的方法非常有效,并且大大优于其他方法,朴素贝叶斯的平均f值为4.5%,支持向量机的平均f值为4.73%。
A new and efficient method based on syntactic dependency relations features for ad hoc clinical question classification
Clinical question classification is an important and a challenging task for any clinical Question Answering (QA) system. It classifies questions into different semantic categories, which indicate the expected semantic type of answers. Indeed, the semantic category allows filtering out irrelevant answer candidates. Existing methods dealing with the problem of clinical question classification don't take into account the syntactic dependency relations in questions. Therefore, this may impact negatively the performance of the clinical question classification system. To overcome this drawback, we propose to incorporate the syntactic dependency relations as discriminative features for machine learning. To evaluate and illustrate the interest of our contribution, we conduct a comparative study using nine methods and two machine-learning algorithms: Naive Bayes and Support Vector Machine (SVM). The obtained results using 4654 clinical questions maintained by the National Library of Medicine (NLM) show that our proposed method is very efficient and outperforms greatly the others by the average F-score of 4.5% for Naive Bayes and 4.73% for SVM.