Clarissa Faye G. Gamboa, Matthew B. Concepcion, Antolin J. Alipio, Dan Michael A. Cortez, Andrew G. Bitancor, M. S. Santos, F. A. L. Atienza, M. A. S. Mercado
{"title":"基于聚类的KNN算法在IT支持票务路由中的进一步改进","authors":"Clarissa Faye G. Gamboa, Matthew B. Concepcion, Antolin J. Alipio, Dan Michael A. Cortez, Andrew G. Bitancor, M. S. Santos, F. A. L. Atienza, M. A. S. Mercado","doi":"10.1109/cniot55862.2022.00040","DOIUrl":null,"url":null,"abstract":"Companies receive millions of tickets from their clients. Unfortunately, manual ticket routing takes time and relies heavily on human resources. To help automate the ticket routing, text classification can assist as it is the process of categorizing a document into a predetermined class based on its content. One algorithm is the K-Nearest Neighbors (KNN) which is a popular supervised technique but ranks average to lowest compared to other classification models. An improved KNN algorithm utilized clustering and improved the accuracy of the classifier. This paper proposed a further enhancement of this algorithm by adding preprocessing techniques, changing the distance formula, and computing for the k-value rather than choosing one. Two datasets of IT support tickets were used to train and test the algorithms. Results showed that this further enhanced algorithm significantly performed better than the initial algorithm with the highest accuracy score of 97.83% in one dataset while the initial algorithm performed best with an accuracy score of 86.34% using a k-value of 4 in another dataset.","PeriodicalId":251734,"journal":{"name":"2022 3rd International Conference on Computing, Networks and Internet of Things (CNIOT)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Further Enhancement of KNN Algorithm Based on Clustering Applied to IT Support Ticket Routing\",\"authors\":\"Clarissa Faye G. Gamboa, Matthew B. Concepcion, Antolin J. Alipio, Dan Michael A. Cortez, Andrew G. Bitancor, M. S. Santos, F. A. L. Atienza, M. A. S. Mercado\",\"doi\":\"10.1109/cniot55862.2022.00040\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Companies receive millions of tickets from their clients. Unfortunately, manual ticket routing takes time and relies heavily on human resources. To help automate the ticket routing, text classification can assist as it is the process of categorizing a document into a predetermined class based on its content. One algorithm is the K-Nearest Neighbors (KNN) which is a popular supervised technique but ranks average to lowest compared to other classification models. An improved KNN algorithm utilized clustering and improved the accuracy of the classifier. This paper proposed a further enhancement of this algorithm by adding preprocessing techniques, changing the distance formula, and computing for the k-value rather than choosing one. Two datasets of IT support tickets were used to train and test the algorithms. Results showed that this further enhanced algorithm significantly performed better than the initial algorithm with the highest accuracy score of 97.83% in one dataset while the initial algorithm performed best with an accuracy score of 86.34% using a k-value of 4 in another dataset.\",\"PeriodicalId\":251734,\"journal\":{\"name\":\"2022 3rd International Conference on Computing, Networks and Internet of Things (CNIOT)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 3rd International Conference on Computing, Networks and Internet of Things (CNIOT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/cniot55862.2022.00040\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 3rd International Conference on Computing, Networks and Internet of Things (CNIOT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/cniot55862.2022.00040","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Further Enhancement of KNN Algorithm Based on Clustering Applied to IT Support Ticket Routing
Companies receive millions of tickets from their clients. Unfortunately, manual ticket routing takes time and relies heavily on human resources. To help automate the ticket routing, text classification can assist as it is the process of categorizing a document into a predetermined class based on its content. One algorithm is the K-Nearest Neighbors (KNN) which is a popular supervised technique but ranks average to lowest compared to other classification models. An improved KNN algorithm utilized clustering and improved the accuracy of the classifier. This paper proposed a further enhancement of this algorithm by adding preprocessing techniques, changing the distance formula, and computing for the k-value rather than choosing one. Two datasets of IT support tickets were used to train and test the algorithms. Results showed that this further enhanced algorithm significantly performed better than the initial algorithm with the highest accuracy score of 97.83% in one dataset while the initial algorithm performed best with an accuracy score of 86.34% using a k-value of 4 in another dataset.