{"title":"A Topic Based Method to Classify the Question Clarity in CQA Networks","authors":"Alireza Khabbazan, A. A. Abin","doi":"10.1109/IKT54664.2021.9685163","DOIUrl":null,"url":null,"abstract":"Better results would be obtained by distinguishing the clarity of questions as well as increasing their quality. This improvement in clarity may improve the output of a search engine when it encounters the query. Furthermore, it can lead to getting the correct answer to a question when asked in CQAs. In this regard, thousands of different questions are posted daily in CQAs, making these questions and their answers one of the world's most valuable information sources. Nonetheless, most of the questions posted in these forums do not result in proper answers, with one of the most important reasons being a lack of clarity in the questions. This paper addresses one of the most important issues in this field, which is classifying questions based on their clarity. For this purpose, a feature vector based on clustering approaches and obtaining similar questions is designed uniquely for each question based on the total data provided in this field. Following that, the questions are classified based on their clarity using a machine learning classification model. Furthermore, we investigated and reported our new approach using other related approaches in this field in the following step. What we describe as an accomplishment in this paper is the high separability of the questions based on the feature vector extracted by different clusters, which has a much higher performance when compared to other proposed textual classification methods.","PeriodicalId":274571,"journal":{"name":"2021 12th International Conference on Information and Knowledge Technology (IKT)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 12th International Conference on Information and Knowledge Technology (IKT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IKT54664.2021.9685163","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Better results would be obtained by distinguishing the clarity of questions as well as increasing their quality. This improvement in clarity may improve the output of a search engine when it encounters the query. Furthermore, it can lead to getting the correct answer to a question when asked in CQAs. In this regard, thousands of different questions are posted daily in CQAs, making these questions and their answers one of the world's most valuable information sources. Nonetheless, most of the questions posted in these forums do not result in proper answers, with one of the most important reasons being a lack of clarity in the questions. This paper addresses one of the most important issues in this field, which is classifying questions based on their clarity. For this purpose, a feature vector based on clustering approaches and obtaining similar questions is designed uniquely for each question based on the total data provided in this field. Following that, the questions are classified based on their clarity using a machine learning classification model. Furthermore, we investigated and reported our new approach using other related approaches in this field in the following step. What we describe as an accomplishment in this paper is the high separability of the questions based on the feature vector extracted by different clusters, which has a much higher performance when compared to other proposed textual classification methods.