An Improved Concept Vector Space Model for Ontology Based Classification

Zenun Kastrati, Ali Shariq Imran, Sule YAYILGAN YILDIRIM
{"title":"An Improved Concept Vector Space Model for Ontology Based Classification","authors":"Zenun Kastrati, Ali Shariq Imran, Sule YAYILGAN YILDIRIM","doi":"10.1109/SITIS.2015.102","DOIUrl":null,"url":null,"abstract":"This paper proposes an improved concept vector space (ICVS) model which takes into account the importance of ontology concepts. Concept importance shows how important a concept is in an ontology. This is reflected by the number of relations a concept has to other concepts. Concept importance is computed automatically by converting the ontology into a graph initially and then employing one of the Markov based algorithms. Concept importance is then aggregated with concept relevance which is computed using the frequency of concept occurrences in the dataset. In order to demonstrate the applicability of our proposed model and to validate its efficacy, we conducted experiments on document classification using concept based vector space model. The dataset used in this paper consists of 348 documents from the funding domain. The results show that the proposed model yields higher classification accuracy comparing to traditional concept vector space (CVS) model, ultimately giving better document classification performance. We also used different classifiers in order to check for the classification accuracy. We tested CVS and ICVS on Naive Bayes and Decision Tree classifiers and the results show that the classification performance in terms of F1 measure is improved when ICVS is used on both classifiers.","PeriodicalId":128616,"journal":{"name":"2015 11th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 11th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SITIS.2015.102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17

Abstract

This paper proposes an improved concept vector space (ICVS) model which takes into account the importance of ontology concepts. Concept importance shows how important a concept is in an ontology. This is reflected by the number of relations a concept has to other concepts. Concept importance is computed automatically by converting the ontology into a graph initially and then employing one of the Markov based algorithms. Concept importance is then aggregated with concept relevance which is computed using the frequency of concept occurrences in the dataset. In order to demonstrate the applicability of our proposed model and to validate its efficacy, we conducted experiments on document classification using concept based vector space model. The dataset used in this paper consists of 348 documents from the funding domain. The results show that the proposed model yields higher classification accuracy comparing to traditional concept vector space (CVS) model, ultimately giving better document classification performance. We also used different classifiers in order to check for the classification accuracy. We tested CVS and ICVS on Naive Bayes and Decision Tree classifiers and the results show that the classification performance in terms of F1 measure is improved when ICVS is used on both classifiers.
基于本体分类的改进概念向量空间模型
本文提出了一种考虑本体概念重要性的改进概念向量空间(ICVS)模型。概念重要性表明了一个概念在本体论中的重要性。这反映在一个概念与其他概念之间的关系数量上。首先将本体转换为图,然后采用一种基于马尔可夫的算法自动计算概念重要性。然后将概念重要性与概念相关性聚合在一起,概念相关性使用数据集中概念出现的频率计算。为了验证模型的适用性和有效性,我们使用基于概念的向量空间模型进行了文档分类实验。本文使用的数据集由来自资助领域的348篇文献组成。结果表明,与传统的概念向量空间(CVS)模型相比,该模型具有更高的分类精度,最终具有更好的文档分类性能。为了检验分类的准确性,我们还使用了不同的分类器。我们在朴素贝叶斯和决策树分类器上对CVS和ICVS进行了测试,结果表明,当ICVS在两个分类器上同时使用时,在F1度量方面的分类性能得到了提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信