A Multi-Dictionary Approach to Abstractness/Concreteness-Based Authorship Attribution

Lubomir Ivanov
{"title":"A Multi-Dictionary Approach to Abstractness/Concreteness-Based Authorship Attribution","authors":"Lubomir Ivanov","doi":"10.32473/flairs.36.133262","DOIUrl":null,"url":null,"abstract":"\n \n \nWe present some early results from a research project aimed at exploring the usefulness of abstractness/concreteness as stylistic features for authorship attribution. We conjecture that authors use abstract/concrete words and phrases in suf- ficiently unique ways, so that machine learning classifiers can learn to distinguish the individual authors’ writing styles. Our approach is based on using the abstractness rat- ings of words and phrases from texts with established au- thorship to generate training vectors for different machine learning classifiers. The combined word/phrase ratings are extracted from two separate abstractness dictionaries – an approach that yields stronger results than using single ab- stractness dictionaries. The paper describes the details of our methodology and compares the results to those obtained using traditional authorship attribution stylistic features. The limitations of our current methodology and directions for further research are outlined at the end of the paper. \n \n \n","PeriodicalId":302103,"journal":{"name":"The International FLAIRS Conference Proceedings","volume":"102 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The International FLAIRS Conference Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32473/flairs.36.133262","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

We present some early results from a research project aimed at exploring the usefulness of abstractness/concreteness as stylistic features for authorship attribution. We conjecture that authors use abstract/concrete words and phrases in suf- ficiently unique ways, so that machine learning classifiers can learn to distinguish the individual authors’ writing styles. Our approach is based on using the abstractness rat- ings of words and phrases from texts with established au- thorship to generate training vectors for different machine learning classifiers. The combined word/phrase ratings are extracted from two separate abstractness dictionaries – an approach that yields stronger results than using single ab- stractness dictionaries. The paper describes the details of our methodology and compares the results to those obtained using traditional authorship attribution stylistic features. The limitations of our current methodology and directions for further research are outlined at the end of the paper.
基于抽象性/具体性的作者归属多词典方法
我们介绍了一个研究项目的一些早期结果,该项目旨在探索抽象/具体作为作者归属的风格特征的有用性。我们推测作者以足够独特的方式使用抽象/具体的单词和短语,以便机器学习分类器可以学习区分单个作者的写作风格。我们的方法是基于使用文本中的单词和短语的抽象性评级,并建立权威来为不同的机器学习分类器生成训练向量。组合的单词/短语评级是从两个独立的抽象字典中提取的——这种方法比使用单个抽象字典产生更强的结果。本文详细介绍了我们的方法,并将结果与使用传统作者归属文体特征获得的结果进行了比较。本文最后概述了当前研究方法的局限性和进一步研究的方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信