A Novel Gender Classification Model based on Convolutional Neural Network through Handwritten Text and Numeral

Pakize ERDOĞMUŞ, Abdullah Talha KABAKUŞ, Enver KÜÇÜKKÜLAHLI, Büşra TAKGİL, Ezgi KARA TİMUÇİN
{"title":"A Novel Gender Classification Model based on Convolutional Neural Network through Handwritten Text and Numeral","authors":"Pakize ERDOĞMUŞ, Abdullah Talha KABAKUŞ, Enver KÜÇÜKKÜLAHLI, Büşra TAKGİL, Ezgi KARA TİMUÇİN","doi":"10.35377/saucis...1337649","DOIUrl":null,"url":null,"abstract":"Human handwriting is used to investigate human characteristics in various applications, including but not limited to biometric authentication, personality profiling, historical document analysis, and forensic investigations. Gender is one of the most distinguishing characteristics of human beings. From this point forth, we propose a novel end-to-end model based on Convolutional Neural Network (CNN) that automatically extracts features from a given handwritten sample, which contains both handwritten text and numerals unlike the related work that uses only handwritten text, and classifies its owner’s gender. In addition to proposing a novel model, we introduce a new dataset that consists of 530 gender-labeled Turkish handwritten samples since, to the best of our knowledge, there does not exist a public gender-labeled Turkish handwriting dataset. Following an exhaustive process of hyperparameter optimization, the proposed CNN featured the most optimal hyperparameters and was both trained and evaluated on this dataset. According to the experimental result, the proposed novel model obtained an accuracy as high as 74.46%, which overperformed the state-of-the-art baselines and is promising on such a task that even humans could not have achieved highly-accurate results for, as of yet.","PeriodicalId":498230,"journal":{"name":"Sakarya university journal of computer and information sciences","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sakarya university journal of computer and information sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35377/saucis...1337649","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Human handwriting is used to investigate human characteristics in various applications, including but not limited to biometric authentication, personality profiling, historical document analysis, and forensic investigations. Gender is one of the most distinguishing characteristics of human beings. From this point forth, we propose a novel end-to-end model based on Convolutional Neural Network (CNN) that automatically extracts features from a given handwritten sample, which contains both handwritten text and numerals unlike the related work that uses only handwritten text, and classifies its owner’s gender. In addition to proposing a novel model, we introduce a new dataset that consists of 530 gender-labeled Turkish handwritten samples since, to the best of our knowledge, there does not exist a public gender-labeled Turkish handwriting dataset. Following an exhaustive process of hyperparameter optimization, the proposed CNN featured the most optimal hyperparameters and was both trained and evaluated on this dataset. According to the experimental result, the proposed novel model obtained an accuracy as high as 74.46%, which overperformed the state-of-the-art baselines and is promising on such a task that even humans could not have achieved highly-accurate results for, as of yet.
基于卷积神经网络的手写体文本和数字性别分类模型
人类笔迹在各种应用中用于调查人类特征,包括但不限于生物识别认证、个性分析、历史文档分析和法医调查。性别是人类最显著的特征之一。基于此,我们提出了一种基于卷积神经网络(CNN)的端到端模型,该模型可以自动从给定的包含手写文本和数字的手写样本中提取特征,并对其所有者的性别进行分类,这与仅使用手写文本的相关工作不同。除了提出一个新的模型外,我们还引入了一个由530个性别标记的土耳其手写样本组成的新数据集,因为据我们所知,不存在公开的性别标记的土耳其手写数据集。经过详尽的超参数优化过程,提出的CNN具有最优的超参数,并在该数据集上进行训练和评估。根据实验结果,所提出的新模型的准确率高达74.46%,超过了最先进的基线,在这样一个即使是人类也无法获得高精度结果的任务上是有希望的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信