{"title":"Improving the Arabic root extraction by using the quadratic splines","authors":"Mohamed Boudchiche, A. Mazroui","doi":"10.1109/ISACV.2018.8354062","DOIUrl":null,"url":null,"abstract":"In this paper, we present an Arabic root extraction system. It provides the root of each word of a given sentence. It is an indispensable tool for several natural language processing applications such as search engines, text classification and information retrieval. The method of extraction used in this work runs in two steps. The first one consists in seeking of all the possible roots of each word analyzed out of context with the morphological analyzer Alkhalil Morpho Sys 2. Then, we develop in the second step a disambiguation approach based on continuous quadratic splines to choose among these roots the one that corresponds to the word context. We test this method on a representative corpus, and we obtained encouraging results with an accuracy of the order of 96%.","PeriodicalId":184662,"journal":{"name":"2018 International Conference on Intelligent Systems and Computer Vision (ISCV)","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Intelligent Systems and Computer Vision (ISCV)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISACV.2018.8354062","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
In this paper, we present an Arabic root extraction system. It provides the root of each word of a given sentence. It is an indispensable tool for several natural language processing applications such as search engines, text classification and information retrieval. The method of extraction used in this work runs in two steps. The first one consists in seeking of all the possible roots of each word analyzed out of context with the morphological analyzer Alkhalil Morpho Sys 2. Then, we develop in the second step a disambiguation approach based on continuous quadratic splines to choose among these roots the one that corresponds to the word context. We test this method on a representative corpus, and we obtained encouraging results with an accuracy of the order of 96%.