Mihir Prajapati, Mitul Nakrani, Tarjni Vyas, Lata Gohil, Shivani Desai, S. Degadwala
{"title":"Automatic Question Tagging using Machine Learning and Deep learning Algorithms","authors":"Mihir Prajapati, Mitul Nakrani, Tarjni Vyas, Lata Gohil, Shivani Desai, S. Degadwala","doi":"10.1109/ICECA55336.2022.10009632","DOIUrl":null,"url":null,"abstract":"Stack Overflow is a well-known website which is utilized by nearly everyone who learns to code, share their knowledge and publicly participate in this question-answering forum. The questions posted on the Stack Overflow forum by a user requires a minimum of 1 tag to be manually entered in by them. Tagging most commonly means to associate some single word information about the context of given text or question. Tagging a question is useful in identifying the category that a question or text belongs. It is also beneficial in providing ease of access to a person having a requirement of specific categories of questions. On analysis of tags associated with the questions on the website, it was found that a large number of the questions are labelled by more than one tags, with many of them not being tagged accurately. Due to this situation, it becomes challenging for the users to search for relevant tags. So, the main aim of this research task is to explore methods and compare different techniques in order to create an auto tagging system with the aid of Machine learning and deep learning facilities, accompanied by data preprocessing steps. The dataset for this purpose was taken from Kaggle, known as StackSample dataset, which is a dataset containing 10 percent of the questions present on the website. The output of the research performed for this purpose provided satisfactory results with scope of improvement.","PeriodicalId":356949,"journal":{"name":"2022 6th International Conference on Electronics, Communication and Aerospace Technology","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 6th International Conference on Electronics, Communication and Aerospace Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECA55336.2022.10009632","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Stack Overflow is a well-known website which is utilized by nearly everyone who learns to code, share their knowledge and publicly participate in this question-answering forum. The questions posted on the Stack Overflow forum by a user requires a minimum of 1 tag to be manually entered in by them. Tagging most commonly means to associate some single word information about the context of given text or question. Tagging a question is useful in identifying the category that a question or text belongs. It is also beneficial in providing ease of access to a person having a requirement of specific categories of questions. On analysis of tags associated with the questions on the website, it was found that a large number of the questions are labelled by more than one tags, with many of them not being tagged accurately. Due to this situation, it becomes challenging for the users to search for relevant tags. So, the main aim of this research task is to explore methods and compare different techniques in order to create an auto tagging system with the aid of Machine learning and deep learning facilities, accompanied by data preprocessing steps. The dataset for this purpose was taken from Kaggle, known as StackSample dataset, which is a dataset containing 10 percent of the questions present on the website. The output of the research performed for this purpose provided satisfactory results with scope of improvement.