作者归属使用委员会机器与k近邻评级投票

A. Kuşakcı
{"title":"作者归属使用委员会机器与k近邻评级投票","authors":"A. Kuşakcı","doi":"10.1109/NEUREL.2012.6419997","DOIUrl":null,"url":null,"abstract":"Authorship attribution, namely determination of the author of a text, may become an extraordinarily complex and sensitive job due to its relatively difficult feature extraction phase and highly nonlinear nature. This paper proposes a classification tool using committee machines consisting of multilayered perceptron neural networks (MLP) to identify the author of a text. Each expert is an individual MLP learning complex input-output relation composed of 14 lexical, stylometric attributes extracted from the corpus. The resulting mapping after training is used to identify the texts in German language written by two different authors. Unlike other committee based classification tools individual answers of the experts are combined with a novel voting method, k-nearest neighbors rated voting. The proposed method shows very promising results when benchmarked with simple majority voting technique.","PeriodicalId":343718,"journal":{"name":"11th Symposium on Neural Network Applications in Electrical Engineering","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Authorship attribution using committee machines with k-nearest neighbors rated voting\",\"authors\":\"A. Kuşakcı\",\"doi\":\"10.1109/NEUREL.2012.6419997\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Authorship attribution, namely determination of the author of a text, may become an extraordinarily complex and sensitive job due to its relatively difficult feature extraction phase and highly nonlinear nature. This paper proposes a classification tool using committee machines consisting of multilayered perceptron neural networks (MLP) to identify the author of a text. Each expert is an individual MLP learning complex input-output relation composed of 14 lexical, stylometric attributes extracted from the corpus. The resulting mapping after training is used to identify the texts in German language written by two different authors. Unlike other committee based classification tools individual answers of the experts are combined with a novel voting method, k-nearest neighbors rated voting. The proposed method shows very promising results when benchmarked with simple majority voting technique.\",\"PeriodicalId\":343718,\"journal\":{\"name\":\"11th Symposium on Neural Network Applications in Electrical Engineering\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"11th Symposium on Neural Network Applications in Electrical Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NEUREL.2012.6419997\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"11th Symposium on Neural Network Applications in Electrical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NEUREL.2012.6419997","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

摘要

作者归属,即文本作者的确定,由于其相对困难的特征提取阶段和高度非线性的性质,可能成为一项极其复杂和敏感的工作。本文提出了一种使用多层感知器神经网络(MLP)组成的委员会机来识别文本作者的分类工具。每个专家都是一个独立的MLP,学习由从语料库中提取的14个词汇、风格属性组成的复杂输入输出关系。训练后的结果映射用于识别由两个不同作者写的德语文本。与其他基于委员会的分类工具不同,专家的个人答案与一种新颖的投票方法相结合,即k近邻评级投票。该方法与简单多数投票技术进行了比较,结果非常理想。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Authorship attribution using committee machines with k-nearest neighbors rated voting
Authorship attribution, namely determination of the author of a text, may become an extraordinarily complex and sensitive job due to its relatively difficult feature extraction phase and highly nonlinear nature. This paper proposes a classification tool using committee machines consisting of multilayered perceptron neural networks (MLP) to identify the author of a text. Each expert is an individual MLP learning complex input-output relation composed of 14 lexical, stylometric attributes extracted from the corpus. The resulting mapping after training is used to identify the texts in German language written by two different authors. Unlike other committee based classification tools individual answers of the experts are combined with a novel voting method, k-nearest neighbors rated voting. The proposed method shows very promising results when benchmarked with simple majority voting technique.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信