{"title":"基于特征的菲文贴贴词随机形态分析仪","authors":"G. A. Ong, Melvin A. Ballera","doi":"10.1109/IICAIET55139.2022.9936850","DOIUrl":null,"url":null,"abstract":"This paper papers presents a featured-based stochastic stemming methods for obtaining affixes in Filipino language. The method aims to introduce a statistical stemming approach that is based on the morphological attributes of Filipino words. Various Filipino word forms from different types of sources were obtained and test for affix removal system. The stemmer initially performs lexicon check from the created lexis which is comprises of common based words and various categorical language forms. Feature examinations are executed to check the data entry's structure. These includes affix removal, word assimilation, partial duplication, derivational words, and inflectional words. A KSTEM assimilatory method from Hybrid Stemming Algorithm are utilized to support derivational and inflectional conditions. From the created stochastic featured-based template algorithm, the entries were analyzed and perform the final phase of the stemming process. An average of 92.46 percent was obtained using the test data and stemming technique.","PeriodicalId":142482,"journal":{"name":"2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Feature-based Stochastic Morphological Analyzer for Filipino Affixed Words\",\"authors\":\"G. A. Ong, Melvin A. Ballera\",\"doi\":\"10.1109/IICAIET55139.2022.9936850\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper papers presents a featured-based stochastic stemming methods for obtaining affixes in Filipino language. The method aims to introduce a statistical stemming approach that is based on the morphological attributes of Filipino words. Various Filipino word forms from different types of sources were obtained and test for affix removal system. The stemmer initially performs lexicon check from the created lexis which is comprises of common based words and various categorical language forms. Feature examinations are executed to check the data entry's structure. These includes affix removal, word assimilation, partial duplication, derivational words, and inflectional words. A KSTEM assimilatory method from Hybrid Stemming Algorithm are utilized to support derivational and inflectional conditions. From the created stochastic featured-based template algorithm, the entries were analyzed and perform the final phase of the stemming process. An average of 92.46 percent was obtained using the test data and stemming technique.\",\"PeriodicalId\":142482,\"journal\":{\"name\":\"2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)\",\"volume\":\"64 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IICAIET55139.2022.9936850\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IICAIET55139.2022.9936850","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Feature-based Stochastic Morphological Analyzer for Filipino Affixed Words
This paper papers presents a featured-based stochastic stemming methods for obtaining affixes in Filipino language. The method aims to introduce a statistical stemming approach that is based on the morphological attributes of Filipino words. Various Filipino word forms from different types of sources were obtained and test for affix removal system. The stemmer initially performs lexicon check from the created lexis which is comprises of common based words and various categorical language forms. Feature examinations are executed to check the data entry's structure. These includes affix removal, word assimilation, partial duplication, derivational words, and inflectional words. A KSTEM assimilatory method from Hybrid Stemming Algorithm are utilized to support derivational and inflectional conditions. From the created stochastic featured-based template algorithm, the entries were analyzed and perform the final phase of the stemming process. An average of 92.46 percent was obtained using the test data and stemming technique.