利用多标签分类方法对经济现象进行分类

Q4 Environmental Science

Asean Journal on Science and Technology for Development Pub Date : 2021-08-30 DOI:10.29037/ajstd.680

Nofriani, N. Kurniawan

{"title":"利用多标签分类方法对经济现象进行分类","authors":"Nofriani, N. Kurniawan","doi":"10.29037/ajstd.680","DOIUrl":null,"url":null,"abstract":"One fashion to report a country’s economic state is by compiling economic phenomena from several sources. The collected data may be explored based on their sentiments and economic categories. This research attempted to perform and analyze multiple approaches to multi-label text classification in addition to providing sentiment analysis on the economic phenomena. The sentiment and single-label category classification was performed utilizing the logistic regression model. Meanwhile, the multi-label category classification was fulfilled using a combination of logistic regression, support vector machines, k-nearest neighbor, naïve Bayes, and decision trees as base classifiers, with binary relevance, classifier chain, and label power set as the implementation approaches. The results showed that logistic regression works well in sentiment and single-label classification, with a classification accuracy of 80.08% and 92.71%, respectively. However, it was also discovered that it works poorly as a base classifier in multi-label classification, indicated by the classification accuracy dropping to 13.35%, 15.40%, and 30.65% for binary relevance, classifier chain, and label power set, respectively. Alternatively, naïve Bayes works best as a base classifier in the label power set approach for multi-label classification, with a classification accuracy of 63.22%, followed by decision trees and support vector machines.","PeriodicalId":8479,"journal":{"name":"Asean Journal on Science and Technology for Development","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Harnessing Multi-label Classification Approaches for Economic Phenomena Categorization\",\"authors\":\"Nofriani, N. Kurniawan\",\"doi\":\"10.29037/ajstd.680\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One fashion to report a country’s economic state is by compiling economic phenomena from several sources. The collected data may be explored based on their sentiments and economic categories. This research attempted to perform and analyze multiple approaches to multi-label text classification in addition to providing sentiment analysis on the economic phenomena. The sentiment and single-label category classification was performed utilizing the logistic regression model. Meanwhile, the multi-label category classification was fulfilled using a combination of logistic regression, support vector machines, k-nearest neighbor, naïve Bayes, and decision trees as base classifiers, with binary relevance, classifier chain, and label power set as the implementation approaches. The results showed that logistic regression works well in sentiment and single-label classification, with a classification accuracy of 80.08% and 92.71%, respectively. However, it was also discovered that it works poorly as a base classifier in multi-label classification, indicated by the classification accuracy dropping to 13.35%, 15.40%, and 30.65% for binary relevance, classifier chain, and label power set, respectively. Alternatively, naïve Bayes works best as a base classifier in the label power set approach for multi-label classification, with a classification accuracy of 63.22%, followed by decision trees and support vector machines.\",\"PeriodicalId\":8479,\"journal\":{\"name\":\"Asean Journal on Science and Technology for Development\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Asean Journal on Science and Technology for Development\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.29037/ajstd.680\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Environmental Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asean Journal on Science and Technology for Development","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29037/ajstd.680","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Environmental Science","Score":null,"Total":0}

引用次数: 1

摘要

报道一个国家经济状况的一种方式是汇总来自多个来源的经济现象。收集到的数据可以根据他们的情绪和经济类别进行探索。本研究除了提供对经济现象的情感分析外，还尝试执行和分析多标签文本分类的多种方法。使用逻辑回归模型进行情感和单标签类别分类。同时，采用逻辑回归、支持向量机、k近邻、naïve贝叶斯和决策树作为基本分类器，以二值关联、分类器链和标签功率集为实现方法，实现多标签类别分类。结果表明，逻辑回归在情感分类和单标签分类中效果良好，分类准确率分别为80.08%和92.71%。然而，我们也发现它作为基础分类器在多标签分类中表现不佳，在二值相关性、分类器链和标签功率集上的分类准确率分别降至13.35%、15.40%和30.65%。另外，naïve贝叶斯作为基础分类器在标签功率集方法中表现最好，分类准确率为63.22%，其次是决策树和支持向量机。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Harnessing Multi-label Classification Approaches for Economic Phenomena Categorization

One fashion to report a country’s economic state is by compiling economic phenomena from several sources. The collected data may be explored based on their sentiments and economic categories. This research attempted to perform and analyze multiple approaches to multi-label text classification in addition to providing sentiment analysis on the economic phenomena. The sentiment and single-label category classification was performed utilizing the logistic regression model. Meanwhile, the multi-label category classification was fulfilled using a combination of logistic regression, support vector machines, k-nearest neighbor, naïve Bayes, and decision trees as base classifiers, with binary relevance, classifier chain, and label power set as the implementation approaches. The results showed that logistic regression works well in sentiment and single-label classification, with a classification accuracy of 80.08% and 92.71%, respectively. However, it was also discovered that it works poorly as a base classifier in multi-label classification, indicated by the classification accuracy dropping to 13.35%, 15.40%, and 30.65% for binary relevance, classifier chain, and label power set, respectively. Alternatively, naïve Bayes works best as a base classifier in the label power set approach for multi-label classification, with a classification accuracy of 63.22%, followed by decision trees and support vector machines.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Asean Journal on Science and Technology for Development Environmental Science-Waste Management and Disposal

CiteScore

1.50

自引率

0.00%

发文量

审稿时长

14 weeks