{"title":"Question identification and classification on an academic question answering site","authors":"B. Ojokoh, Tobore Igbe, A. Araoye, Friday Ameh","doi":"10.1145/2910896.2925442","DOIUrl":null,"url":null,"abstract":"Online communities such as wikis, blogs, forums, scientific communities and other social networking services have enabled new levels of interactions and interconnections among individuals, documents and data and have become places for people to seek and share expertise. In this paper, we propose a systematic approach to identification and classification of questions. The questions were first identified using semantic occurrence of Part of Speech (POS) tag in English Language, after which they were classified based on maximum probability value of Naïve Bayes classification. The model was validated and evaluated with experiments on some crawled web pages from ResearchGate.","PeriodicalId":109613,"journal":{"name":"2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2910896.2925442","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10
Abstract
Online communities such as wikis, blogs, forums, scientific communities and other social networking services have enabled new levels of interactions and interconnections among individuals, documents and data and have become places for people to seek and share expertise. In this paper, we propose a systematic approach to identification and classification of questions. The questions were first identified using semantic occurrence of Part of Speech (POS) tag in English Language, after which they were classified based on maximum probability value of Naïve Bayes classification. The model was validated and evaluated with experiments on some crawled web pages from ResearchGate.