{"title":"对提取的阿拉伯语候选词的过滤建议","authors":"Imen Bouaziz Mezghanni, F. Gargouri","doi":"10.1109/ICTA.2015.7426927","DOIUrl":null,"url":null,"abstract":"In the terminology extraction process, determining relevance of the candidates is very crucial for the purpose of identifying domain-relevant terms. Information about terms can be often gathered from linguistic knowledge or from statistic measures. In this paper, we present a proposition of a filtering mechanism based on a machine learning technique so as to keep only the most relevant terms. The proposed strategy incorporates varied and rich features from the content as well as the structure of Arabic legal documents.","PeriodicalId":375443,"journal":{"name":"2015 5th International Conference on Information & Communication Technology and Accessibility (ICTA)","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A filtering proposal for extracted Arabic term candidates\",\"authors\":\"Imen Bouaziz Mezghanni, F. Gargouri\",\"doi\":\"10.1109/ICTA.2015.7426927\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the terminology extraction process, determining relevance of the candidates is very crucial for the purpose of identifying domain-relevant terms. Information about terms can be often gathered from linguistic knowledge or from statistic measures. In this paper, we present a proposition of a filtering mechanism based on a machine learning technique so as to keep only the most relevant terms. The proposed strategy incorporates varied and rich features from the content as well as the structure of Arabic legal documents.\",\"PeriodicalId\":375443,\"journal\":{\"name\":\"2015 5th International Conference on Information & Communication Technology and Accessibility (ICTA)\",\"volume\":\"69 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 5th International Conference on Information & Communication Technology and Accessibility (ICTA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICTA.2015.7426927\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 5th International Conference on Information & Communication Technology and Accessibility (ICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTA.2015.7426927","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A filtering proposal for extracted Arabic term candidates
In the terminology extraction process, determining relevance of the candidates is very crucial for the purpose of identifying domain-relevant terms. Information about terms can be often gathered from linguistic knowledge or from statistic measures. In this paper, we present a proposition of a filtering mechanism based on a machine learning technique so as to keep only the most relevant terms. The proposed strategy incorporates varied and rich features from the content as well as the structure of Arabic legal documents.