基于主题的CQA网络问题清晰度分类方法

Alireza Khabbazan, A. A. Abin
{"title":"基于主题的CQA网络问题清晰度分类方法","authors":"Alireza Khabbazan, A. A. Abin","doi":"10.1109/IKT54664.2021.9685163","DOIUrl":null,"url":null,"abstract":"Better results would be obtained by distinguishing the clarity of questions as well as increasing their quality. This improvement in clarity may improve the output of a search engine when it encounters the query. Furthermore, it can lead to getting the correct answer to a question when asked in CQAs. In this regard, thousands of different questions are posted daily in CQAs, making these questions and their answers one of the world's most valuable information sources. Nonetheless, most of the questions posted in these forums do not result in proper answers, with one of the most important reasons being a lack of clarity in the questions. This paper addresses one of the most important issues in this field, which is classifying questions based on their clarity. For this purpose, a feature vector based on clustering approaches and obtaining similar questions is designed uniquely for each question based on the total data provided in this field. Following that, the questions are classified based on their clarity using a machine learning classification model. Furthermore, we investigated and reported our new approach using other related approaches in this field in the following step. What we describe as an accomplishment in this paper is the high separability of the questions based on the feature vector extracted by different clusters, which has a much higher performance when compared to other proposed textual classification methods.","PeriodicalId":274571,"journal":{"name":"2021 12th International Conference on Information and Knowledge Technology (IKT)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Topic Based Method to Classify the Question Clarity in CQA Networks\",\"authors\":\"Alireza Khabbazan, A. A. Abin\",\"doi\":\"10.1109/IKT54664.2021.9685163\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Better results would be obtained by distinguishing the clarity of questions as well as increasing their quality. This improvement in clarity may improve the output of a search engine when it encounters the query. Furthermore, it can lead to getting the correct answer to a question when asked in CQAs. In this regard, thousands of different questions are posted daily in CQAs, making these questions and their answers one of the world's most valuable information sources. Nonetheless, most of the questions posted in these forums do not result in proper answers, with one of the most important reasons being a lack of clarity in the questions. This paper addresses one of the most important issues in this field, which is classifying questions based on their clarity. For this purpose, a feature vector based on clustering approaches and obtaining similar questions is designed uniquely for each question based on the total data provided in this field. Following that, the questions are classified based on their clarity using a machine learning classification model. Furthermore, we investigated and reported our new approach using other related approaches in this field in the following step. What we describe as an accomplishment in this paper is the high separability of the questions based on the feature vector extracted by different clusters, which has a much higher performance when compared to other proposed textual classification methods.\",\"PeriodicalId\":274571,\"journal\":{\"name\":\"2021 12th International Conference on Information and Knowledge Technology (IKT)\",\"volume\":\"114 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 12th International Conference on Information and Knowledge Technology (IKT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IKT54664.2021.9685163\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 12th International Conference on Information and Knowledge Technology (IKT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IKT54664.2021.9685163","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

通过区分问题的清晰度和提高问题的质量,将获得更好的结果。这种清晰度的提高可能会改善搜索引擎在遇到查询时的输出。此外,当在cqa中提出问题时,它可以导致获得正确的答案。在这方面,cqa每天发布数千个不同的问题,使这些问题及其答案成为世界上最有价值的信息来源之一。尽管如此,在这些论坛上发布的大多数问题都没有得到正确的答案,其中最重要的原因之一是问题缺乏清晰度。本文解决了这一领域中最重要的问题之一,即基于清晰度的问题分类。为此,根据该领域提供的总数据,为每个问题设计了基于聚类方法和获取相似问题的特征向量。然后,使用机器学习分类模型根据问题的清晰度对其进行分类。此外,在接下来的步骤中,我们利用该领域的其他相关方法对我们的新方法进行了调查和报道。本文的一个成就是基于不同聚类提取的特征向量对问题进行了高可分性,与其他提出的文本分类方法相比,具有更高的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Topic Based Method to Classify the Question Clarity in CQA Networks
Better results would be obtained by distinguishing the clarity of questions as well as increasing their quality. This improvement in clarity may improve the output of a search engine when it encounters the query. Furthermore, it can lead to getting the correct answer to a question when asked in CQAs. In this regard, thousands of different questions are posted daily in CQAs, making these questions and their answers one of the world's most valuable information sources. Nonetheless, most of the questions posted in these forums do not result in proper answers, with one of the most important reasons being a lack of clarity in the questions. This paper addresses one of the most important issues in this field, which is classifying questions based on their clarity. For this purpose, a feature vector based on clustering approaches and obtaining similar questions is designed uniquely for each question based on the total data provided in this field. Following that, the questions are classified based on their clarity using a machine learning classification model. Furthermore, we investigated and reported our new approach using other related approaches in this field in the following step. What we describe as an accomplishment in this paper is the high separability of the questions based on the feature vector extracted by different clusters, which has a much higher performance when compared to other proposed textual classification methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信