基于聚类的KNN算法在IT支持票务路由中的进一步改进

Clarissa Faye G. Gamboa, Matthew B. Concepcion, Antolin J. Alipio, Dan Michael A. Cortez, Andrew G. Bitancor, M. S. Santos, F. A. L. Atienza, M. A. S. Mercado
{"title":"基于聚类的KNN算法在IT支持票务路由中的进一步改进","authors":"Clarissa Faye G. Gamboa, Matthew B. Concepcion, Antolin J. Alipio, Dan Michael A. Cortez, Andrew G. Bitancor, M. S. Santos, F. A. L. Atienza, M. A. S. Mercado","doi":"10.1109/cniot55862.2022.00040","DOIUrl":null,"url":null,"abstract":"Companies receive millions of tickets from their clients. Unfortunately, manual ticket routing takes time and relies heavily on human resources. To help automate the ticket routing, text classification can assist as it is the process of categorizing a document into a predetermined class based on its content. One algorithm is the K-Nearest Neighbors (KNN) which is a popular supervised technique but ranks average to lowest compared to other classification models. An improved KNN algorithm utilized clustering and improved the accuracy of the classifier. This paper proposed a further enhancement of this algorithm by adding preprocessing techniques, changing the distance formula, and computing for the k-value rather than choosing one. Two datasets of IT support tickets were used to train and test the algorithms. Results showed that this further enhanced algorithm significantly performed better than the initial algorithm with the highest accuracy score of 97.83% in one dataset while the initial algorithm performed best with an accuracy score of 86.34% using a k-value of 4 in another dataset.","PeriodicalId":251734,"journal":{"name":"2022 3rd International Conference on Computing, Networks and Internet of Things (CNIOT)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Further Enhancement of KNN Algorithm Based on Clustering Applied to IT Support Ticket Routing\",\"authors\":\"Clarissa Faye G. Gamboa, Matthew B. Concepcion, Antolin J. Alipio, Dan Michael A. Cortez, Andrew G. Bitancor, M. S. Santos, F. A. L. Atienza, M. A. S. Mercado\",\"doi\":\"10.1109/cniot55862.2022.00040\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Companies receive millions of tickets from their clients. Unfortunately, manual ticket routing takes time and relies heavily on human resources. To help automate the ticket routing, text classification can assist as it is the process of categorizing a document into a predetermined class based on its content. One algorithm is the K-Nearest Neighbors (KNN) which is a popular supervised technique but ranks average to lowest compared to other classification models. An improved KNN algorithm utilized clustering and improved the accuracy of the classifier. This paper proposed a further enhancement of this algorithm by adding preprocessing techniques, changing the distance formula, and computing for the k-value rather than choosing one. Two datasets of IT support tickets were used to train and test the algorithms. Results showed that this further enhanced algorithm significantly performed better than the initial algorithm with the highest accuracy score of 97.83% in one dataset while the initial algorithm performed best with an accuracy score of 86.34% using a k-value of 4 in another dataset.\",\"PeriodicalId\":251734,\"journal\":{\"name\":\"2022 3rd International Conference on Computing, Networks and Internet of Things (CNIOT)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 3rd International Conference on Computing, Networks and Internet of Things (CNIOT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/cniot55862.2022.00040\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 3rd International Conference on Computing, Networks and Internet of Things (CNIOT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/cniot55862.2022.00040","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

公司从客户那里收到数百万张门票。不幸的是,手动票务路由需要时间,并且严重依赖人力资源。为了帮助自动化票据路由,文本分类可以提供帮助,因为它是根据文档的内容将文档分类为预定的类的过程。一种算法是k近邻(KNN),这是一种流行的监督技术,但与其他分类模型相比,它的排名从平均到最低。改进的KNN算法利用聚类,提高了分类器的准确率。本文通过增加预处理技术,改变距离公式,计算k值而不是选择k值,对该算法进行了进一步的改进。使用两个IT支持票数据集对算法进行训练和测试。结果表明,进一步增强后的算法在一个数据集上的准确率最高为97.83%,明显优于初始算法;在另一个数据集上,k值为4时,初始算法的准确率最高为86.34%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Further Enhancement of KNN Algorithm Based on Clustering Applied to IT Support Ticket Routing
Companies receive millions of tickets from their clients. Unfortunately, manual ticket routing takes time and relies heavily on human resources. To help automate the ticket routing, text classification can assist as it is the process of categorizing a document into a predetermined class based on its content. One algorithm is the K-Nearest Neighbors (KNN) which is a popular supervised technique but ranks average to lowest compared to other classification models. An improved KNN algorithm utilized clustering and improved the accuracy of the classifier. This paper proposed a further enhancement of this algorithm by adding preprocessing techniques, changing the distance formula, and computing for the k-value rather than choosing one. Two datasets of IT support tickets were used to train and test the algorithms. Results showed that this further enhanced algorithm significantly performed better than the initial algorithm with the highest accuracy score of 97.83% in one dataset while the initial algorithm performed best with an accuracy score of 86.34% using a k-value of 4 in another dataset.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信