{"title":"基于条件随机场的波斯语问题分类","authors":"A. Mollaei, S. Rahati-Quchani, A. Estaji","doi":"10.1109/ICCKE.2012.6395395","DOIUrl":null,"url":null,"abstract":"The question classification system is one of the important subsystems in the Question Answering Systems (QAS). In such systems through retrieval methods and information extraction the texts are retrieved in order to get to a correct answer. The current study is designed to present the architecture of question classification (QC) in Persian based on the Conditional Random Fields (CRF) machine learning model and evaluate effects of various features on its accuracy. In this study, sentences were classified into two levels of coarse and fine classes based on the type of the answer to each question. After extracting features and setting sliding window on the CRF model, CRF question classifier (QC) is train. Then, the QC predicts labels for every token in question. Next, a majority voting on the question classification output, is used to extract a unique label for each question. Further, the effects of different features on the ultimate accuracy of the system were evaluated. Finally results of this question classifier, illustrate a satisfactory accuracy.","PeriodicalId":154379,"journal":{"name":"2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE)","volume":"56 3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":"{\"title\":\"Question classification in Persian language based on conditional random fields\",\"authors\":\"A. Mollaei, S. Rahati-Quchani, A. Estaji\",\"doi\":\"10.1109/ICCKE.2012.6395395\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The question classification system is one of the important subsystems in the Question Answering Systems (QAS). In such systems through retrieval methods and information extraction the texts are retrieved in order to get to a correct answer. The current study is designed to present the architecture of question classification (QC) in Persian based on the Conditional Random Fields (CRF) machine learning model and evaluate effects of various features on its accuracy. In this study, sentences were classified into two levels of coarse and fine classes based on the type of the answer to each question. After extracting features and setting sliding window on the CRF model, CRF question classifier (QC) is train. Then, the QC predicts labels for every token in question. Next, a majority voting on the question classification output, is used to extract a unique label for each question. Further, the effects of different features on the ultimate accuracy of the system were evaluated. Finally results of this question classifier, illustrate a satisfactory accuracy.\",\"PeriodicalId\":154379,\"journal\":{\"name\":\"2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE)\",\"volume\":\"56 3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-12-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"11\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCKE.2012.6395395\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 2nd International eConference on Computer and Knowledge Engineering (ICCKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCKE.2012.6395395","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Question classification in Persian language based on conditional random fields
The question classification system is one of the important subsystems in the Question Answering Systems (QAS). In such systems through retrieval methods and information extraction the texts are retrieved in order to get to a correct answer. The current study is designed to present the architecture of question classification (QC) in Persian based on the Conditional Random Fields (CRF) machine learning model and evaluate effects of various features on its accuracy. In this study, sentences were classified into two levels of coarse and fine classes based on the type of the answer to each question. After extracting features and setting sliding window on the CRF model, CRF question classifier (QC) is train. Then, the QC predicts labels for every token in question. Next, a majority voting on the question classification output, is used to extract a unique label for each question. Further, the effects of different features on the ultimate accuracy of the system were evaluated. Finally results of this question classifier, illustrate a satisfactory accuracy.