{"title":"乌尔都语的诗人归属:寻找短文本的最佳配置","authors":"M. A. Rao, Tafseer Ahmed","doi":"10.51153/kjcis.v4i2.58","DOIUrl":null,"url":null,"abstract":"\n \n \n \nThis study presents a machine learning system to identify the poet of a given poetic piece consisting of 2 lines (i.e. a couplet) or more. The task is more difficult than the general task of author attribution, as the number of words in verses and poems are usually less than the number of articles present in author attribution datasets. We applied classification algorithms with different sets of feature configurations to run several experiments and found that the system performs best when support vector machine using a combination of unigram and bigram are used . The best system (for 5 Urdu poets) has the accuracy of 88.7%. \n \n \n \n","PeriodicalId":299009,"journal":{"name":"KIET Journal of Computing and Information Sciences","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Poet Attribution for Urdu: Finding Optimal Configuration for Short Text\",\"authors\":\"M. A. Rao, Tafseer Ahmed\",\"doi\":\"10.51153/kjcis.v4i2.58\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n \\n \\n \\nThis study presents a machine learning system to identify the poet of a given poetic piece consisting of 2 lines (i.e. a couplet) or more. The task is more difficult than the general task of author attribution, as the number of words in verses and poems are usually less than the number of articles present in author attribution datasets. We applied classification algorithms with different sets of feature configurations to run several experiments and found that the system performs best when support vector machine using a combination of unigram and bigram are used . The best system (for 5 Urdu poets) has the accuracy of 88.7%. \\n \\n \\n \\n\",\"PeriodicalId\":299009,\"journal\":{\"name\":\"KIET Journal of Computing and Information Sciences\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"KIET Journal of Computing and Information Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.51153/kjcis.v4i2.58\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"KIET Journal of Computing and Information Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.51153/kjcis.v4i2.58","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Poet Attribution for Urdu: Finding Optimal Configuration for Short Text
This study presents a machine learning system to identify the poet of a given poetic piece consisting of 2 lines (i.e. a couplet) or more. The task is more difficult than the general task of author attribution, as the number of words in verses and poems are usually less than the number of articles present in author attribution datasets. We applied classification algorithms with different sets of feature configurations to run several experiments and found that the system performs best when support vector machine using a combination of unigram and bigram are used . The best system (for 5 Urdu poets) has the accuracy of 88.7%.