Hacking stylometry with multiple voices: Imaginary writers can override authorial signal in Delta

IF 0.7 3区 文学 0 HUMANITIES, MULTIDISCIPLINARY
Daniil Skorinkin, Boris Orekhov
{"title":"Hacking stylometry with multiple voices: Imaginary writers can override authorial signal in Delta","authors":"Daniil Skorinkin, Boris Orekhov","doi":"10.1093/llc/fqad012","DOIUrl":null,"url":null,"abstract":"Abstract It is a basic assumption of stylometry that texts written by the same person show greater stylometric similarity even if published under multiple pennames. Statistical authorship attribution strongly relies on the ability of Burrows’s Delta and its variants to cluster one author together regardless of pseudonyms. At the same time, the very first computational discoveries by the founder of modern stylometry showed that a single author is capable of producing multiple voices (Burrows, 1987, Computation into Criticism: A Study of Jane Austen’s Novels and an Experiment in Method. Clarendon Press). We investigate two authors whose stylistically autonomous pennames seem to deceive Delta and override authorial signals: a Portuguese poet Fernando Pessoa and a French novelist Romain Gary. Pessoa managed to create at least three pennames (the author himself used the term ‘heteronym’) who exhibit all traits of individual human beings from the stylometric point of view. Gary’s alter ego Emile Ajar, who was an intentional literary mystification, also demonstrates traits of stylometric autonomy. At the same time, other pseudonyms used by Gary lack that autonomy completely. Our investigation shows that there appears to be a continuum between a purely formal use of a penname, which brings almost no distinction from the real name of an author, and a strong literary sub-personality such as those created by Pessoa.","PeriodicalId":45315,"journal":{"name":"Digital Scholarship in the Humanities","volume":"31 1","pages":"0"},"PeriodicalIF":0.7000,"publicationDate":"2023-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Scholarship in the Humanities","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/llc/fqad012","RegionNum":3,"RegionCategory":"文学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"HUMANITIES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 1

Abstract

Abstract It is a basic assumption of stylometry that texts written by the same person show greater stylometric similarity even if published under multiple pennames. Statistical authorship attribution strongly relies on the ability of Burrows’s Delta and its variants to cluster one author together regardless of pseudonyms. At the same time, the very first computational discoveries by the founder of modern stylometry showed that a single author is capable of producing multiple voices (Burrows, 1987, Computation into Criticism: A Study of Jane Austen’s Novels and an Experiment in Method. Clarendon Press). We investigate two authors whose stylistically autonomous pennames seem to deceive Delta and override authorial signals: a Portuguese poet Fernando Pessoa and a French novelist Romain Gary. Pessoa managed to create at least three pennames (the author himself used the term ‘heteronym’) who exhibit all traits of individual human beings from the stylometric point of view. Gary’s alter ego Emile Ajar, who was an intentional literary mystification, also demonstrates traits of stylometric autonomy. At the same time, other pseudonyms used by Gary lack that autonomy completely. Our investigation shows that there appears to be a continuum between a purely formal use of a penname, which brings almost no distinction from the real name of an author, and a strong literary sub-personality such as those created by Pessoa.
用多种声音破解文体学:想象中的作家可以在Delta中覆盖作者信号
文体学的一个基本假设是,同一个人所写的文章即使用多个笔名发表,也会表现出更大的文体学相似性。统计作者归属很大程度上依赖于Burrows 's Delta及其变体将一个作者聚在一起的能力,而不考虑笔名。与此同时,现代文体学创始人的第一个计算发现表明,单个作者能够产生多种声音(巴罗斯,1987,计算到批评:简·奥斯汀小说研究和方法实验)。克拉伦登出版社)。我们研究了两位作家,他们风格自主的笔名似乎欺骗了德尔塔,并掩盖了作者的信号:葡萄牙诗人费尔南多·佩索阿和法国小说家罗曼·加里。佩索阿设法创造了至少三个笔名(作者自己使用了“异名”这个词),从文体学的角度来看,这些笔名表现出了人类个体的所有特征。加里的另一个自我埃米尔·阿加尔(Emile Ajar)是一个有意的文学神秘化者,也表现出风格自主的特征。与此同时,加里使用的其他假名完全缺乏这种自主权。我们的调查表明,在笔名的纯粹正式使用(与作者的真实姓名几乎没有区别)和佩索阿所创造的那种强烈的文学个性之间,似乎存在一种连续性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
1.80
自引率
25.00%
发文量
78
期刊介绍: DSH or Digital Scholarship in the Humanities is an international, peer reviewed journal which publishes original contributions on all aspects of digital scholarship in the Humanities including, but not limited to, the field of what is currently called the Digital Humanities. Long and short papers report on theoretical, methodological, experimental, and applied research and include results of research projects, descriptions and evaluations of tools, techniques, and methodologies, and reports on work in progress. DSH also publishes reviews of books and resources. Digital Scholarship in the Humanities was previously known as Literary and Linguistic Computing.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信