{"title":"An XML database for modern standard Arabic (MSA) verbs generated from triliteral roots","authors":"Youssef Tahir","doi":"10.1109/CIST.2014.7016637","DOIUrl":null,"url":null,"abstract":"In this paper, we present an exhaustive database for Modern Standard Arabic (MSA) verbs generated from trilateral roots. This database is initially represented as a root-pattern matrix listing rows of all recognized roots and columns of all verb patterns in MSA. The intersection of each row and column contains an index indicating the compatibility of the aforementioned root-pattern pair. This index refers also to a list of morpho-syntactic characteristics of the generated verb. We later converted the database into the more flexible XML format. The aim for our approach is twofold: with the objective of building an exhaustive list, we opted for automatic generation of all possible trilateral roots in the Arabic alphabet and subsequent filtering of roots not recognized in the literature; secondly, converting the database into XML creates a highly versatile resource for easy integration in Arabic NLP applications.","PeriodicalId":106483,"journal":{"name":"2014 Third IEEE International Colloquium in Information Science and Technology (CIST)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Third IEEE International Colloquium in Information Science and Technology (CIST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIST.2014.7016637","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we present an exhaustive database for Modern Standard Arabic (MSA) verbs generated from trilateral roots. This database is initially represented as a root-pattern matrix listing rows of all recognized roots and columns of all verb patterns in MSA. The intersection of each row and column contains an index indicating the compatibility of the aforementioned root-pattern pair. This index refers also to a list of morpho-syntactic characteristics of the generated verb. We later converted the database into the more flexible XML format. The aim for our approach is twofold: with the objective of building an exhaustive list, we opted for automatic generation of all possible trilateral roots in the Arabic alphabet and subsequent filtering of roots not recognized in the literature; secondly, converting the database into XML creates a highly versatile resource for easy integration in Arabic NLP applications.