利用多标签分类方法对经济现象进行分类

Q4 Environmental Science
Nofriani, N. Kurniawan
{"title":"利用多标签分类方法对经济现象进行分类","authors":"Nofriani, N. Kurniawan","doi":"10.29037/ajstd.680","DOIUrl":null,"url":null,"abstract":"One fashion to report a country’s economic state is by compiling economic phenomena from several sources. The collected data may be explored based on their sentiments and economic categories. This research attempted to perform and analyze multiple approaches to multi-label text classification in addition to providing sentiment analysis on the economic phenomena. The sentiment and single-label category classification was performed utilizing the logistic regression model. Meanwhile, the multi-label category classification was fulfilled using a combination of logistic regression, support vector machines, k-nearest neighbor, naïve Bayes, and decision trees as base classifiers, with binary relevance, classifier chain, and label power set as the implementation approaches. The results showed that logistic regression works well in sentiment and single-label classification, with a classification accuracy of 80.08% and 92.71%, respectively. However, it was also discovered that it works poorly as a base classifier in multi-label classification, indicated by the classification accuracy dropping to 13.35%, 15.40%, and 30.65% for binary relevance, classifier chain, and label power set, respectively. Alternatively, naïve Bayes works best as a base classifier in the label power set approach for multi-label classification, with a classification accuracy of 63.22%, followed by decision trees and support vector machines.","PeriodicalId":8479,"journal":{"name":"Asean Journal on Science and Technology for Development","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Harnessing Multi-label Classification Approaches for Economic Phenomena Categorization\",\"authors\":\"Nofriani, N. Kurniawan\",\"doi\":\"10.29037/ajstd.680\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One fashion to report a country’s economic state is by compiling economic phenomena from several sources. The collected data may be explored based on their sentiments and economic categories. This research attempted to perform and analyze multiple approaches to multi-label text classification in addition to providing sentiment analysis on the economic phenomena. The sentiment and single-label category classification was performed utilizing the logistic regression model. Meanwhile, the multi-label category classification was fulfilled using a combination of logistic regression, support vector machines, k-nearest neighbor, naïve Bayes, and decision trees as base classifiers, with binary relevance, classifier chain, and label power set as the implementation approaches. The results showed that logistic regression works well in sentiment and single-label classification, with a classification accuracy of 80.08% and 92.71%, respectively. However, it was also discovered that it works poorly as a base classifier in multi-label classification, indicated by the classification accuracy dropping to 13.35%, 15.40%, and 30.65% for binary relevance, classifier chain, and label power set, respectively. Alternatively, naïve Bayes works best as a base classifier in the label power set approach for multi-label classification, with a classification accuracy of 63.22%, followed by decision trees and support vector machines.\",\"PeriodicalId\":8479,\"journal\":{\"name\":\"Asean Journal on Science and Technology for Development\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Asean Journal on Science and Technology for Development\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.29037/ajstd.680\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Environmental Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asean Journal on Science and Technology for Development","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29037/ajstd.680","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Environmental Science","Score":null,"Total":0}
引用次数: 1

摘要

报道一个国家经济状况的一种方式是汇总来自多个来源的经济现象。收集到的数据可以根据他们的情绪和经济类别进行探索。本研究除了提供对经济现象的情感分析外,还尝试执行和分析多标签文本分类的多种方法。使用逻辑回归模型进行情感和单标签类别分类。同时,采用逻辑回归、支持向量机、k近邻、naïve贝叶斯和决策树作为基本分类器,以二值关联、分类器链和标签功率集为实现方法,实现多标签类别分类。结果表明,逻辑回归在情感分类和单标签分类中效果良好,分类准确率分别为80.08%和92.71%。然而,我们也发现它作为基础分类器在多标签分类中表现不佳,在二值相关性、分类器链和标签功率集上的分类准确率分别降至13.35%、15.40%和30.65%。另外,naïve贝叶斯作为基础分类器在标签功率集方法中表现最好,分类准确率为63.22%,其次是决策树和支持向量机。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Harnessing Multi-label Classification Approaches for Economic Phenomena Categorization
One fashion to report a country’s economic state is by compiling economic phenomena from several sources. The collected data may be explored based on their sentiments and economic categories. This research attempted to perform and analyze multiple approaches to multi-label text classification in addition to providing sentiment analysis on the economic phenomena. The sentiment and single-label category classification was performed utilizing the logistic regression model. Meanwhile, the multi-label category classification was fulfilled using a combination of logistic regression, support vector machines, k-nearest neighbor, naïve Bayes, and decision trees as base classifiers, with binary relevance, classifier chain, and label power set as the implementation approaches. The results showed that logistic regression works well in sentiment and single-label classification, with a classification accuracy of 80.08% and 92.71%, respectively. However, it was also discovered that it works poorly as a base classifier in multi-label classification, indicated by the classification accuracy dropping to 13.35%, 15.40%, and 30.65% for binary relevance, classifier chain, and label power set, respectively. Alternatively, naïve Bayes works best as a base classifier in the label power set approach for multi-label classification, with a classification accuracy of 63.22%, followed by decision trees and support vector machines.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Asean Journal on Science and Technology for Development
Asean Journal on Science and Technology for Development Environmental Science-Waste Management and Disposal
CiteScore
1.50
自引率
0.00%
发文量
10
审稿时长
14 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信