{"title":"Performance Analysis of Machine Learning and Deep Learning Techniques for Answer Type extraction of Marathi Questions","authors":"Dhiraj Amin, S. Govilkar, Sagar Kulkarni","doi":"10.1109/ICNTE56631.2023.10146625","DOIUrl":null,"url":null,"abstract":"Question answering systems involve extraction of correct answers for natural language questions provided as input. Question classification is an important part of the question processing phase where natural language questions are categorized into predefined classes which specifies expected type of answer for the question. Extraction of answer type can be performed using machine learning and deep learning classification techniques which helps to reduce the list of possible correct answers for a question. In this paper we have compared various classification techniques which can be used for building a Marathi question classification system. Additionally, we have created a Marathi question classification dataset by translating existing TREC dataset available in English language. We observed that fine tuning the RoBERTa based monolingual language model for question classification was the best classification technique with accuracy of 91% in the coarse grained category of question classification and accuracy of 85% in the fine grained category of question classification.","PeriodicalId":158124,"journal":{"name":"2023 5th Biennial International Conference on Nascent Technologies in Engineering (ICNTE)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 5th Biennial International Conference on Nascent Technologies in Engineering (ICNTE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNTE56631.2023.10146625","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Question answering systems involve extraction of correct answers for natural language questions provided as input. Question classification is an important part of the question processing phase where natural language questions are categorized into predefined classes which specifies expected type of answer for the question. Extraction of answer type can be performed using machine learning and deep learning classification techniques which helps to reduce the list of possible correct answers for a question. In this paper we have compared various classification techniques which can be used for building a Marathi question classification system. Additionally, we have created a Marathi question classification dataset by translating existing TREC dataset available in English language. We observed that fine tuning the RoBERTa based monolingual language model for question classification was the best classification technique with accuracy of 91% in the coarse grained category of question classification and accuracy of 85% in the fine grained category of question classification.