基于本体的生物医学文献分类特征加权

Dan He, Xindong Wu
{"title":"基于本体的生物医学文献分类特征加权","authors":"Dan He, Xindong Wu","doi":"10.1109/IRI.2006.252426","DOIUrl":null,"url":null,"abstract":"Ontology-based methods have been applied to biomedical literature classification tasks recently. By mapping lexically different but semantically similar words into features in the domain ontology that underlies the words, we can achieve at least two benefits: the dimensionality of the feature space can be reduced effectively, and the semantic information that underlies the lexical words can be incorporated into the classification process, leading to better classification accuracies. In this paper, we propose an ontology-based feature weighting strategy for the biomedical literature classification problem. We assign weights to the features into which the lexical words are mapped, according to the structure of the domain ontology, and further optimize the weights using cross-validation. Our experiments on MEDLINE-indexed journal abstracts demonstrate that our method can achieve a significant improvement on the classification accuracies, especially when the classification task is hard","PeriodicalId":402255,"journal":{"name":"2006 IEEE International Conference on Information Reuse & Integration","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Ontology-Based Feature Weighting for Biomedical Literature Classification\",\"authors\":\"Dan He, Xindong Wu\",\"doi\":\"10.1109/IRI.2006.252426\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Ontology-based methods have been applied to biomedical literature classification tasks recently. By mapping lexically different but semantically similar words into features in the domain ontology that underlies the words, we can achieve at least two benefits: the dimensionality of the feature space can be reduced effectively, and the semantic information that underlies the lexical words can be incorporated into the classification process, leading to better classification accuracies. In this paper, we propose an ontology-based feature weighting strategy for the biomedical literature classification problem. We assign weights to the features into which the lexical words are mapped, according to the structure of the domain ontology, and further optimize the weights using cross-validation. Our experiments on MEDLINE-indexed journal abstracts demonstrate that our method can achieve a significant improvement on the classification accuracies, especially when the classification task is hard\",\"PeriodicalId\":402255,\"journal\":{\"name\":\"2006 IEEE International Conference on Information Reuse & Integration\",\"volume\":\"98 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2006-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2006 IEEE International Conference on Information Reuse & Integration\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IRI.2006.252426\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 IEEE International Conference on Information Reuse & Integration","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI.2006.252426","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11

摘要

近年来,基于本体的方法已被应用于生物医学文献分类任务中。通过将词法不同但语义相似的词映射到作为词基础的领域本体的特征中,我们至少可以获得两个好处:有效地降低特征空间的维数,将词法词基础的语义信息纳入分类过程,从而提高分类精度。针对生物医学文献分类问题,提出了一种基于本体的特征加权策略。我们根据领域本体的结构为词汇词映射到的特征分配权重,并使用交叉验证进一步优化权重。我们对medline索引期刊摘要的实验表明,我们的方法可以显著提高分类精度,特别是在分类任务较困难的情况下
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Ontology-Based Feature Weighting for Biomedical Literature Classification
Ontology-based methods have been applied to biomedical literature classification tasks recently. By mapping lexically different but semantically similar words into features in the domain ontology that underlies the words, we can achieve at least two benefits: the dimensionality of the feature space can be reduced effectively, and the semantic information that underlies the lexical words can be incorporated into the classification process, leading to better classification accuracies. In this paper, we propose an ontology-based feature weighting strategy for the biomedical literature classification problem. We assign weights to the features into which the lexical words are mapped, according to the structure of the domain ontology, and further optimize the weights using cross-validation. Our experiments on MEDLINE-indexed journal abstracts demonstrate that our method can achieve a significant improvement on the classification accuracies, especially when the classification task is hard
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信