Identifying Contextual Information in Document Classification using Term Weighting

P. R. Deshmukh, R. Phalnikar
{"title":"Identifying Contextual Information in Document Classification using Term Weighting","authors":"P. R. Deshmukh, R. Phalnikar","doi":"10.1109/IADCC.2018.8692141","DOIUrl":null,"url":null,"abstract":"Document classification particularly in biomedical research plays a vital role in extracting knowledge from medical literature, journal, article and report. To extract meaningful information such as signs, symptoms, diagnoses and treatments of any disease by classification, the context needs to be considered. The need to automatically extract key information from medical text has been widely accepted and it has been proven that search based approaches are limited in their ability. This paper presents a novel method of information identification for a particular disease using Gaussian Naïve Bayes and feature weighting approach that is then classified by the context. It is useful to enhance the effectiveness of analytics by considering the importance of the term as well as the probability of every feature of the disease during classification. Experimental results show that our method upgrades performance of classification system and is an improvement from traditional classification system.","PeriodicalId":365713,"journal":{"name":"2018 IEEE 8th International Advance Computing Conference (IACC)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 8th International Advance Computing Conference (IACC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IADCC.2018.8692141","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Document classification particularly in biomedical research plays a vital role in extracting knowledge from medical literature, journal, article and report. To extract meaningful information such as signs, symptoms, diagnoses and treatments of any disease by classification, the context needs to be considered. The need to automatically extract key information from medical text has been widely accepted and it has been proven that search based approaches are limited in their ability. This paper presents a novel method of information identification for a particular disease using Gaussian Naïve Bayes and feature weighting approach that is then classified by the context. It is useful to enhance the effectiveness of analytics by considering the importance of the term as well as the probability of every feature of the disease during classification. Experimental results show that our method upgrades performance of classification system and is an improvement from traditional classification system.
使用词加权识别文档分类中的上下文信息
文献分类在从医学文献、期刊、文章和报告中提取知识方面起着至关重要的作用,特别是在生物医学研究中。为了通过分类提取任何疾病的体征、症状、诊断和治疗等有意义的信息,需要考虑上下文。从医学文本中自动提取关键信息的需求已经被广泛接受,并且已经证明基于搜索的方法在其能力上是有限的。本文提出了一种使用高斯Naïve贝叶斯和特征加权方法对特定疾病进行信息识别的新方法,然后根据上下文进行分类。通过考虑术语的重要性以及在分类过程中疾病的每个特征的概率,有助于提高分析的有效性。实验结果表明,该方法提高了分类系统的性能,是对传统分类系统的改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信