{"title":"Inter country poetry classification using Topic modeling","authors":"K. P. Kumar, T. Padmaja","doi":"10.1109/ICAITPR51569.2022.9844213","DOIUrl":null,"url":null,"abstract":"Poetry is an art of arranging the carefully picked words in a specific order to express the authors experience and emotions. Poetry in India has its strong roots with world famous and excellent poets, amongst few renowned poets are “Universal Poet” Rabindranath Tagore, “Nightingale of India” Sarojini Naidu and “Swami” Vivekananda. Poetry style varies from country to country depending on the author. Author’s poetry topics, words and style depends on the circumstances they raised in, situations they faced, and their mind set. Many authors who belongs to India but settled in western countries and written their poems, in this context automatically identifying a poem’s author is a challenging task for the literary scholars who analyze the poetry. In this work authors proposed a method based on Latent Dirichlet Allocation(LDA) topic modeling to classify the poetry written by Indian or western(American)poet based on the distribution of topics per document. The experiment is performed on 3 data sets 128, 1600 and author wise poems respectively. This experiment is performed based on semantic features. Best result 91% precision and 88% accuracy is achieved on author wise poems data set using random forest algorithm","PeriodicalId":262409,"journal":{"name":"2022 First International Conference on Artificial Intelligence Trends and Pattern Recognition (ICAITPR)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 First International Conference on Artificial Intelligence Trends and Pattern Recognition (ICAITPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAITPR51569.2022.9844213","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Poetry is an art of arranging the carefully picked words in a specific order to express the authors experience and emotions. Poetry in India has its strong roots with world famous and excellent poets, amongst few renowned poets are “Universal Poet” Rabindranath Tagore, “Nightingale of India” Sarojini Naidu and “Swami” Vivekananda. Poetry style varies from country to country depending on the author. Author’s poetry topics, words and style depends on the circumstances they raised in, situations they faced, and their mind set. Many authors who belongs to India but settled in western countries and written their poems, in this context automatically identifying a poem’s author is a challenging task for the literary scholars who analyze the poetry. In this work authors proposed a method based on Latent Dirichlet Allocation(LDA) topic modeling to classify the poetry written by Indian or western(American)poet based on the distribution of topics per document. The experiment is performed on 3 data sets 128, 1600 and author wise poems respectively. This experiment is performed based on semantic features. Best result 91% precision and 88% accuracy is achieved on author wise poems data set using random forest algorithm