{"title":"一个多层次的形态学和随机他加禄语词干提取模板","authors":"G. A. Ong, Melvin A. Ballera","doi":"10.1109/CCWC54503.2022.9720873","DOIUrl":null,"url":null,"abstract":"Tagalog is the basis of the Philippine language, that is widely spoken throughout the majority of Philippine regions. Currently, various Filipino language morphological investigations employs a language dependent methodology that is evidently efficient. However, due to its substantial morphological traits and emergent evolution, the native language is regarded to be morpho syntactically rich. This paper proposed a stochastic and multi-level morpho-tactical system for stemming the Filipino language that can cover limited frameworks. Numerous word forms were gathered and examined to create a stochastic template. It derives morphological weights from multi-layer systems, emphasizing the importance of language-dependent and language-independent approaches. Additionally, the study incorporates unusual and novel data such as slang words, borrowed words, street jargon in “Taglish,” and colloquial phrases used in formal and informal interactions, works, and literature.","PeriodicalId":101590,"journal":{"name":"2022 IEEE 12th Annual Computing and Communication Workshop and Conference (CCWC)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Multi-level Morphological and Stochastic Tagalog Stemming Template\",\"authors\":\"G. A. Ong, Melvin A. Ballera\",\"doi\":\"10.1109/CCWC54503.2022.9720873\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Tagalog is the basis of the Philippine language, that is widely spoken throughout the majority of Philippine regions. Currently, various Filipino language morphological investigations employs a language dependent methodology that is evidently efficient. However, due to its substantial morphological traits and emergent evolution, the native language is regarded to be morpho syntactically rich. This paper proposed a stochastic and multi-level morpho-tactical system for stemming the Filipino language that can cover limited frameworks. Numerous word forms were gathered and examined to create a stochastic template. It derives morphological weights from multi-layer systems, emphasizing the importance of language-dependent and language-independent approaches. Additionally, the study incorporates unusual and novel data such as slang words, borrowed words, street jargon in “Taglish,” and colloquial phrases used in formal and informal interactions, works, and literature.\",\"PeriodicalId\":101590,\"journal\":{\"name\":\"2022 IEEE 12th Annual Computing and Communication Workshop and Conference (CCWC)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 12th Annual Computing and Communication Workshop and Conference (CCWC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCWC54503.2022.9720873\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 12th Annual Computing and Communication Workshop and Conference (CCWC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCWC54503.2022.9720873","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Multi-level Morphological and Stochastic Tagalog Stemming Template
Tagalog is the basis of the Philippine language, that is widely spoken throughout the majority of Philippine regions. Currently, various Filipino language morphological investigations employs a language dependent methodology that is evidently efficient. However, due to its substantial morphological traits and emergent evolution, the native language is regarded to be morpho syntactically rich. This paper proposed a stochastic and multi-level morpho-tactical system for stemming the Filipino language that can cover limited frameworks. Numerous word forms were gathered and examined to create a stochastic template. It derives morphological weights from multi-layer systems, emphasizing the importance of language-dependent and language-independent approaches. Additionally, the study incorporates unusual and novel data such as slang words, borrowed words, street jargon in “Taglish,” and colloquial phrases used in formal and informal interactions, works, and literature.