{"title":"Disentangling semantic and prosodic features of English poetry","authors":"Wenyi Shang, Ted Underwood","doi":"10.1093/llc/fqae008","DOIUrl":null,"url":null,"abstract":"The distinction between genre and form is still contested in literary studies. While scholars associated with the New Formalism are criticized for perceiving everything as a form, digital humanists tend to argue that everything is a genre. In this research, we employed machine learning models to classify 36,635 English poems in the Chadwyck-Healey Literature Collections into twenty-seven categories, focusing on their semantic features (lexicons) and prosodic features (meters and rhymes) independently. Our findings reveal that different categories of poetry are distinguished by different groups of characteristics, without a clear-cut division between those driven predominantly by semantic features and those driven predominantly by prosodic features. Instead, poetry categories manifest a combination of semantic and prosodic elements, spanning a spectrum of different strengths in both domains. These findings suggest that the colloquial distinction between “genre” and “form” is based on real differences between poetic categories, although those differences may not be quite as crisply binary as the vocabulary implies.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"48 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2024-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Scholarship in the Humanities","FirstCategoryId":"98","ListUrlMain":"https://doi.org/10.1093/llc/fqae008","RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"HUMANITIES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The distinction between genre and form is still contested in literary studies. While scholars associated with the New Formalism are criticized for perceiving everything as a form, digital humanists tend to argue that everything is a genre. In this research, we employed machine learning models to classify 36,635 English poems in the Chadwyck-Healey Literature Collections into twenty-seven categories, focusing on their semantic features (lexicons) and prosodic features (meters and rhymes) independently. Our findings reveal that different categories of poetry are distinguished by different groups of characteristics, without a clear-cut division between those driven predominantly by semantic features and those driven predominantly by prosodic features. Instead, poetry categories manifest a combination of semantic and prosodic elements, spanning a spectrum of different strengths in both domains. These findings suggest that the colloquial distinction between “genre” and “form” is based on real differences between poetic categories, although those differences may not be quite as crisply binary as the vocabulary implies.
期刊介绍:
DSH or Digital Scholarship in the Humanities is an international, peer reviewed journal which publishes original contributions on all aspects of digital scholarship in the Humanities including, but not limited to, the field of what is currently called the Digital Humanities. Long and short papers report on theoretical, methodological, experimental, and applied research and include results of research projects, descriptions and evaluations of tools, techniques, and methodologies, and reports on work in progress. DSH also publishes reviews of books and resources. Digital Scholarship in the Humanities was previously known as Literary and Linguistic Computing.