Question Classification for Helpdesk Support Forum Using Support Vector Machine and Naïve Bayes Algorithm

IF 1.3 Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Noor Aklima Harun, S. Huspi, Noorminshah A. Iahad
{"title":"Question Classification for Helpdesk Support Forum Using Support Vector Machine and Naïve Bayes Algorithm","authors":"Noor Aklima Harun, S. Huspi, Noorminshah A. Iahad","doi":"10.11113/ijic.v13n1.388","DOIUrl":null,"url":null,"abstract":"The helpdesk support system is now essential in ensuring the journey of support services runs more systematically. One of the elements that contribute to the non-uniformity of the question data in the Helpdesk Support System is the diversity of services and users. Most questions asked in the system are in various forms and sentence styles but usually offer the same meaning making its hard for automation of the question classification process. This has led to problems such as the tickets being forwarded to the wrong resolver group, causing the ticket transfer process to take longer response. The key findings in the exploration results revealed that tickets with a high number of transfer transactions take longer to complete than tickets compared to no transfer transaction. Thus, this research aims to develop an automated question classification model for the Helpdesk Support System by applying supervised machine learning methods: Naïve Bayes (NB) and Support Vector Machine (SVM). The domain will use a readily available dataset from the IT Unit. The results using these techniques are then evaluated using confusion matrix and classification report evaluation, including precision, recall, and F1-Measure measurement. The outcomes showed that the SVM algorithm and TF-IDF feature extraction outperformed in terms of accuracy score compared to the NB algorithm. It is expected that this study will have a significant impact on the productivity of team technical and system owners in dealing with the increasing number of comments, feedback, and complaints presented by end-users.","PeriodicalId":50314,"journal":{"name":"International Journal of Innovative Computing Information and Control","volume":"111 1","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2023-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Innovative Computing Information and Control","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11113/ijic.v13n1.388","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

The helpdesk support system is now essential in ensuring the journey of support services runs more systematically. One of the elements that contribute to the non-uniformity of the question data in the Helpdesk Support System is the diversity of services and users. Most questions asked in the system are in various forms and sentence styles but usually offer the same meaning making its hard for automation of the question classification process. This has led to problems such as the tickets being forwarded to the wrong resolver group, causing the ticket transfer process to take longer response. The key findings in the exploration results revealed that tickets with a high number of transfer transactions take longer to complete than tickets compared to no transfer transaction. Thus, this research aims to develop an automated question classification model for the Helpdesk Support System by applying supervised machine learning methods: Naïve Bayes (NB) and Support Vector Machine (SVM). The domain will use a readily available dataset from the IT Unit. The results using these techniques are then evaluated using confusion matrix and classification report evaluation, including precision, recall, and F1-Measure measurement. The outcomes showed that the SVM algorithm and TF-IDF feature extraction outperformed in terms of accuracy score compared to the NB algorithm. It is expected that this study will have a significant impact on the productivity of team technical and system owners in dealing with the increasing number of comments, feedback, and complaints presented by end-users.
基于支持向量机和Naïve贝叶斯算法的Helpdesk支持论坛问题分类
帮助台支持系统现在对于确保支持服务的旅程更系统地运行至关重要。导致Helpdesk支持系统中问题数据不一致的因素之一是服务和用户的多样性。系统中提出的大多数问题都有不同的形式和句子风格,但通常提供相同的含义,这使得问题分类过程的自动化变得困难。这导致了一些问题,例如将票据转发到错误的解析器组,从而导致票据传输过程需要更长的响应时间。勘探结果的主要发现表明,与没有转让交易的票相比,有大量转让交易的票需要更长的时间才能完成。因此,本研究旨在应用监督机器学习方法:Naïve贝叶斯(NB)和支持向量机(SVM),为Helpdesk支持系统开发一个自动问题分类模型。域将使用来自IT单元的现成可用的数据集。然后使用混淆矩阵和分类报告评估来评估使用这些技术的结果,包括精度、召回率和F1-Measure测量。结果表明,SVM算法和TF-IDF特征提取在准确率得分上优于NB算法。预计这项研究将对团队技术和系统所有者在处理由最终用户提出的越来越多的评论、反馈和投诉方面的生产力产生重大影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.20
自引率
20.00%
发文量
0
审稿时长
4.3 months
期刊介绍: The primary aim of the International Journal of Innovative Computing, Information and Control (IJICIC) is to publish high-quality papers of new developments and trends, novel techniques and approaches, innovative methodologies and technologies on the theory and applications of intelligent systems, information and control. The IJICIC is a peer-reviewed English language journal and is published bimonthly
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信