基于SVC分类器的新闻标题分类

Goldius Leonard, Fukriandy Sisnadi, Nicholas Vigo Wardhana, Muhammad Abdul Aziz Al-Ghofari, A. S. Girsang
{"title":"基于SVC分类器的新闻标题分类","authors":"Goldius Leonard, Fukriandy Sisnadi, Nicholas Vigo Wardhana, Muhammad Abdul Aziz Al-Ghofari, A. S. Girsang","doi":"10.1109/TSSA56819.2022.10063879","DOIUrl":null,"url":null,"abstract":"News gives new insight and information from all over the world. News has many categories, such as politic, economy, science, and other common news categories. Every news will have their own category based on its content. The classification of news is usually done manually by inputting the category during the news posting. Some of the categories may be inputted incorrectly. The news classifier can be the solution for problem, but the news classifications out there are usually based on the news content. The classifier will receive the word vector inputs that are taken from the news content and try to classify it into one of certain categories. Unfortunately, news contents can be longer and harder to be processed rather than processing the news headline. The news headline is shorter and packs a decent information for the classifier to find out what category it is. Besides the news headline usage, the classifier also needs to be chosen correctly. In this paper, the SVC model will be tested using the news headline data to classify the news and compare with several other models, such as Linear Regression, Multinomial Naive Bayes, Decision Tree, and Random Forest. The common variables to be compared are the accuracy, recall, and precision to evaluate the SVC model.","PeriodicalId":164665,"journal":{"name":"2022 16th International Conference on Telecommunication Systems, Services, and Applications (TSSA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"News Classification Based On News Headline Using SVC Classifier\",\"authors\":\"Goldius Leonard, Fukriandy Sisnadi, Nicholas Vigo Wardhana, Muhammad Abdul Aziz Al-Ghofari, A. S. Girsang\",\"doi\":\"10.1109/TSSA56819.2022.10063879\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"News gives new insight and information from all over the world. News has many categories, such as politic, economy, science, and other common news categories. Every news will have their own category based on its content. The classification of news is usually done manually by inputting the category during the news posting. Some of the categories may be inputted incorrectly. The news classifier can be the solution for problem, but the news classifications out there are usually based on the news content. The classifier will receive the word vector inputs that are taken from the news content and try to classify it into one of certain categories. Unfortunately, news contents can be longer and harder to be processed rather than processing the news headline. The news headline is shorter and packs a decent information for the classifier to find out what category it is. Besides the news headline usage, the classifier also needs to be chosen correctly. In this paper, the SVC model will be tested using the news headline data to classify the news and compare with several other models, such as Linear Regression, Multinomial Naive Bayes, Decision Tree, and Random Forest. The common variables to be compared are the accuracy, recall, and precision to evaluate the SVC model.\",\"PeriodicalId\":164665,\"journal\":{\"name\":\"2022 16th International Conference on Telecommunication Systems, Services, and Applications (TSSA)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 16th International Conference on Telecommunication Systems, Services, and Applications (TSSA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TSSA56819.2022.10063879\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 16th International Conference on Telecommunication Systems, Services, and Applications (TSSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TSSA56819.2022.10063879","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

新闻提供来自世界各地的新见解和信息。新闻有许多类别,如政治、经济、科学和其他常见的新闻类别。每条新闻都将根据其内容有自己的分类。新闻的分类通常是通过在新闻发布过程中输入类别来手动完成的。有些类别可能输入不正确。新闻分类器可以解决问题,但现有的新闻分类通常是基于新闻内容的。分类器将接收从新闻内容中提取的词向量输入,并尝试将其分类到某些类别中。不幸的是,与处理新闻标题相比,处理新闻内容可能更长、更困难。新闻标题更短,并且为分类器找出它是什么类别提供了体面的信息。除了新闻标题的使用,分类器的选择也需要正确。本文将使用新闻标题数据来测试SVC模型对新闻进行分类,并与其他几种模型(如线性回归、多项式朴素贝叶斯、决策树和随机森林)进行比较。要比较的常见变量是评估SVC模型的准确性、召回率和精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
News Classification Based On News Headline Using SVC Classifier
News gives new insight and information from all over the world. News has many categories, such as politic, economy, science, and other common news categories. Every news will have their own category based on its content. The classification of news is usually done manually by inputting the category during the news posting. Some of the categories may be inputted incorrectly. The news classifier can be the solution for problem, but the news classifications out there are usually based on the news content. The classifier will receive the word vector inputs that are taken from the news content and try to classify it into one of certain categories. Unfortunately, news contents can be longer and harder to be processed rather than processing the news headline. The news headline is shorter and packs a decent information for the classifier to find out what category it is. Besides the news headline usage, the classifier also needs to be chosen correctly. In this paper, the SVC model will be tested using the news headline data to classify the news and compare with several other models, such as Linear Regression, Multinomial Naive Bayes, Decision Tree, and Random Forest. The common variables to be compared are the accuracy, recall, and precision to evaluate the SVC model.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信