{"title":"On Automating XSEDE User Ticket Classification","authors":"Gwang Son, Victor Hazlewood, G. D. Peterson","doi":"10.1145/2616498.2616549","DOIUrl":null,"url":null,"abstract":"The XSEDE ticket system, which is a help desk ticketing system, receives email and web-based problem reports (i.e., tickets) from users and these tickets can be manually grouped into predefined categories either by the ticket submitter or by operations staff. This manual process can be automated by using text classification algorithms such as Multinomial Naive Bayes (MNB) or Softmax Regression Neural Network (SNN). Ticket subjects, rather than whole tickets, were used to make an input word list along with a manual word group list to enhance accuracy. The text mining algorithms used the input word list to select input words in the tickets. Compared with the Matlab svm() function, MNB and SNN showed overall better accuracy (up to ~85.8% using two simultaneous category selection). Also, the service provider resource (i.e., system name) information could be extracted from the tickets with ~90% accuracy.","PeriodicalId":93364,"journal":{"name":"Proceedings of XSEDE16 : Diversity, Big Data, and Science at Scale : July 17-21, 2016, Intercontinental Miami Hotel, Miami, Florida, USA. Conference on Extreme Science and Engineering Discovery Environment (5th : 2016 : Miami, Fla.)","volume":"65 1","pages":"41:1-41:7"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of XSEDE16 : Diversity, Big Data, and Science at Scale : July 17-21, 2016, Intercontinental Miami Hotel, Miami, Florida, USA. Conference on Extreme Science and Engineering Discovery Environment (5th : 2016 : Miami, Fla.)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2616498.2616549","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
The XSEDE ticket system, which is a help desk ticketing system, receives email and web-based problem reports (i.e., tickets) from users and these tickets can be manually grouped into predefined categories either by the ticket submitter or by operations staff. This manual process can be automated by using text classification algorithms such as Multinomial Naive Bayes (MNB) or Softmax Regression Neural Network (SNN). Ticket subjects, rather than whole tickets, were used to make an input word list along with a manual word group list to enhance accuracy. The text mining algorithms used the input word list to select input words in the tickets. Compared with the Matlab svm() function, MNB and SNN showed overall better accuracy (up to ~85.8% using two simultaneous category selection). Also, the service provider resource (i.e., system name) information could be extracted from the tickets with ~90% accuracy.