{"title":"基于语料库的语义映射的Saraiki词网开发","authors":"Sarah Gul, Musarrat Azher, S. Nawaz","doi":"10.32350/llr.72/04","DOIUrl":null,"url":null,"abstract":"This paper aimed to develop the Saraiki WordNet. Saraiki is one of the regional languages spoken in Pakistan and has a unique history of its own. Saraiki language is remarkably similar to two languages, namely Punjabi and Sindhi. Saraiki has different dialects and each dialect is representative of the region where it is spoken. This paper used the Urdu WordNet (Zafar, Mahmood, Shams & Hussain, 2014) as the basis for the formation of Saraiki WordNet. Urdu WordNet (Zafar et al., 2014) was created by UET Lahore and is based on Princeton WordNet (Miller, 1990). Dictionaries or lughats and literary sources, such as poetry, fiction, as well as non-literary sources, such as newspapers of Saraiki language, were used to extract data. Additionally, Urdu word senses were mapped onto Saraiki word senses. The method used in this study was mapping, while the expansion approach was used in the mapping process. This study may aid in creating bilingual dictionaries (of Saraiki and Urdu?) in the future. \nKeywords: expand approach, mapping, Saraiki language, WordNet","PeriodicalId":135226,"journal":{"name":"Linguistics and Literature Review","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Development of Saraiki WordNet by Mapping of Word Senses: A Corpus-based Approach\",\"authors\":\"Sarah Gul, Musarrat Azher, S. Nawaz\",\"doi\":\"10.32350/llr.72/04\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper aimed to develop the Saraiki WordNet. Saraiki is one of the regional languages spoken in Pakistan and has a unique history of its own. Saraiki language is remarkably similar to two languages, namely Punjabi and Sindhi. Saraiki has different dialects and each dialect is representative of the region where it is spoken. This paper used the Urdu WordNet (Zafar, Mahmood, Shams & Hussain, 2014) as the basis for the formation of Saraiki WordNet. Urdu WordNet (Zafar et al., 2014) was created by UET Lahore and is based on Princeton WordNet (Miller, 1990). Dictionaries or lughats and literary sources, such as poetry, fiction, as well as non-literary sources, such as newspapers of Saraiki language, were used to extract data. Additionally, Urdu word senses were mapped onto Saraiki word senses. The method used in this study was mapping, while the expansion approach was used in the mapping process. This study may aid in creating bilingual dictionaries (of Saraiki and Urdu?) in the future. \\nKeywords: expand approach, mapping, Saraiki language, WordNet\",\"PeriodicalId\":135226,\"journal\":{\"name\":\"Linguistics and Literature Review\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Linguistics and Literature Review\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.32350/llr.72/04\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Linguistics and Literature Review","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32350/llr.72/04","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Development of Saraiki WordNet by Mapping of Word Senses: A Corpus-based Approach
This paper aimed to develop the Saraiki WordNet. Saraiki is one of the regional languages spoken in Pakistan and has a unique history of its own. Saraiki language is remarkably similar to two languages, namely Punjabi and Sindhi. Saraiki has different dialects and each dialect is representative of the region where it is spoken. This paper used the Urdu WordNet (Zafar, Mahmood, Shams & Hussain, 2014) as the basis for the formation of Saraiki WordNet. Urdu WordNet (Zafar et al., 2014) was created by UET Lahore and is based on Princeton WordNet (Miller, 1990). Dictionaries or lughats and literary sources, such as poetry, fiction, as well as non-literary sources, such as newspapers of Saraiki language, were used to extract data. Additionally, Urdu word senses were mapped onto Saraiki word senses. The method used in this study was mapping, while the expansion approach was used in the mapping process. This study may aid in creating bilingual dictionaries (of Saraiki and Urdu?) in the future.
Keywords: expand approach, mapping, Saraiki language, WordNet