邻域影响在文本分类中的探讨

2012 Fourth International Conference on Knowledge and Systems Engineering Pub Date : 2012-08-17 DOI:10.1109/KSE.2012.35

N. Le, T. Tran, M. Tran

{"title":"邻域影响在文本分类中的探讨","authors":"N. Le, T. Tran, M. Tran","doi":"10.1109/KSE.2012.35","DOIUrl":null,"url":null,"abstract":"Standard supervised learning approaches have been widely applied on the text classification problem. These standard approaches exploit only the local content of the document. However, the additional information in the relationship between the items can be used to improve the overall accuracy of the classification process. To make use of this information, the authors propose a statistical model to capture both the contents and labels from each link the neighborhood. This link model is then incorporated with the Markov Random Field model to form the soft labeling model for text classification. This new approach has combined both the local content and the influence from the neighborhood. The results of soft labeling model on standard data sets are also promising. Moreover, the new model can be applied on not only the text classification problem but also many kinds of richly structured data sets.","PeriodicalId":122680,"journal":{"name":"2012 Fourth International Conference on Knowledge and Systems Engineering","volume":"77 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Exploring Neighborhood Influence in Text Classification\",\"authors\":\"N. Le, T. Tran, M. Tran\",\"doi\":\"10.1109/KSE.2012.35\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Standard supervised learning approaches have been widely applied on the text classification problem. These standard approaches exploit only the local content of the document. However, the additional information in the relationship between the items can be used to improve the overall accuracy of the classification process. To make use of this information, the authors propose a statistical model to capture both the contents and labels from each link the neighborhood. This link model is then incorporated with the Markov Random Field model to form the soft labeling model for text classification. This new approach has combined both the local content and the influence from the neighborhood. The results of soft labeling model on standard data sets are also promising. Moreover, the new model can be applied on not only the text classification problem but also many kinds of richly structured data sets.\",\"PeriodicalId\":122680,\"journal\":{\"name\":\"2012 Fourth International Conference on Knowledge and Systems Engineering\",\"volume\":\"77 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-08-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 Fourth International Conference on Knowledge and Systems Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/KSE.2012.35\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Fourth International Conference on Knowledge and Systems Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KSE.2012.35","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

标准监督学习方法在文本分类问题上得到了广泛的应用。这些标准方法仅利用文档的本地内容。然而，项目之间关系中的附加信息可用于提高分类过程的总体准确性。为了利用这些信息，作者提出了一个统计模型来捕获邻居的每个链接的内容和标签。然后将该链接模型与马尔可夫随机场模型结合，形成文本分类的软标注模型。这种新方法结合了本地内容和社区的影响。软标注模型在标准数据集上的效果也很好。此外，该模型不仅可以应用于文本分类问题，还可以应用于多种丰富结构的数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Exploring Neighborhood Influence in Text Classification

Standard supervised learning approaches have been widely applied on the text classification problem. These standard approaches exploit only the local content of the document. However, the additional information in the relationship between the items can be used to improve the overall accuracy of the classification process. To make use of this information, the authors propose a statistical model to capture both the contents and labels from each link the neighborhood. This link model is then incorporated with the Markov Random Field model to form the soft labeling model for text classification. This new approach has combined both the local content and the influence from the neighborhood. The results of soft labeling model on standard data sets are also promising. Moreover, the new model can be applied on not only the text classification problem but also many kinds of richly structured data sets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2012 Fourth International Conference on Knowledge and Systems Engineering

自引率

0.00%

发文量