Arif Ridho Lubis, S. Prayudani, Y. Fatmi, Okvi Nugroho
{"title":"基于LightGBM的印尼新闻分类","authors":"Arif Ridho Lubis, S. Prayudani, Y. Fatmi, Okvi Nugroho","doi":"10.1109/CENIM56801.2022.10037401","DOIUrl":null,"url":null,"abstract":"There were several news categories that present editors with challenges. Some news categories, such megapolitan, national, celebrity, news, lifestyle, and economics, used vocabulary that was quite similar to that of the other categories. International required an editor to be familiar with the article's contents in order for it to be uploaded and placed in the proper category. We had to first digest the news before we could label it compared to other kinds of data. The text mining approach, which attempts to make text or documents may be processed so that it will aid in the process of news classification, will be used to categorize and determine the type of news in this context. The Light Gradient Boosted Machine (LightGBM) model was used in this study to increase the gradient point with a learning stage and obtain the optimal value. This model's training process was intended to be quick while consuming less storage space and processing information more accurately. The accuracy of the classifications made using a confusion matrix to quantify the findings of this investigation, which were news type classifications, was 86%.","PeriodicalId":118934,"journal":{"name":"2022 International Conference on Computer Engineering, Network, and Intelligent Multimedia (CENIM)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Classifying News Based on Indonesian News Using LightGBM\",\"authors\":\"Arif Ridho Lubis, S. Prayudani, Y. Fatmi, Okvi Nugroho\",\"doi\":\"10.1109/CENIM56801.2022.10037401\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There were several news categories that present editors with challenges. Some news categories, such megapolitan, national, celebrity, news, lifestyle, and economics, used vocabulary that was quite similar to that of the other categories. International required an editor to be familiar with the article's contents in order for it to be uploaded and placed in the proper category. We had to first digest the news before we could label it compared to other kinds of data. The text mining approach, which attempts to make text or documents may be processed so that it will aid in the process of news classification, will be used to categorize and determine the type of news in this context. The Light Gradient Boosted Machine (LightGBM) model was used in this study to increase the gradient point with a learning stage and obtain the optimal value. This model's training process was intended to be quick while consuming less storage space and processing information more accurately. The accuracy of the classifications made using a confusion matrix to quantify the findings of this investigation, which were news type classifications, was 86%.\",\"PeriodicalId\":118934,\"journal\":{\"name\":\"2022 International Conference on Computer Engineering, Network, and Intelligent Multimedia (CENIM)\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Computer Engineering, Network, and Intelligent Multimedia (CENIM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CENIM56801.2022.10037401\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Computer Engineering, Network, and Intelligent Multimedia (CENIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CENIM56801.2022.10037401","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Classifying News Based on Indonesian News Using LightGBM
There were several news categories that present editors with challenges. Some news categories, such megapolitan, national, celebrity, news, lifestyle, and economics, used vocabulary that was quite similar to that of the other categories. International required an editor to be familiar with the article's contents in order for it to be uploaded and placed in the proper category. We had to first digest the news before we could label it compared to other kinds of data. The text mining approach, which attempts to make text or documents may be processed so that it will aid in the process of news classification, will be used to categorize and determine the type of news in this context. The Light Gradient Boosted Machine (LightGBM) model was used in this study to increase the gradient point with a learning stage and obtain the optimal value. This model's training process was intended to be quick while consuming less storage space and processing information more accurately. The accuracy of the classifications made using a confusion matrix to quantify the findings of this investigation, which were news type classifications, was 86%.