Digits Recognition with Quadrant Photodiode and Convolutional Neural Network

Kamil Janczyk, Krzysztof Czuszyński, J. Rumiński
{"title":"Digits Recognition with Quadrant Photodiode and Convolutional Neural Network","authors":"Kamil Janczyk, Krzysztof Czuszyński, J. Rumiński","doi":"10.1109/HSI.2018.8431246","DOIUrl":null,"url":null,"abstract":"In this paper we have investigated the capabilities of a quadrant photodiode based gesture sensor in the recognition of digits drawn in the air. The sensor consisting of 4 active elements, 4 LEDs and a pinhole was considered as input interface for both discrete and continuous gestures. Index finger and a round pointer were used as navigating mediums for the sensor. Experiments performed with 5 volunteers allowed to record 300 examples of each digit from 0 to 9, which were drawn in the air. Digits were converted from a list of recorded coordinates into images processed as in the MNIST database. Three approaches for recognition of digits recorded by quadrant photodiode were considered: convolutional neural network trained only on examples from the MNIST database, network trained on mixed data of MNIST with examples recorded using quadrant photodiode (4/1 proportions) and trained on the MNIST with examples recorded using the elaborated sensor but after the arbitral rejection of 20% of worst quality data (4/1 proportions preserved). The application of the third approach in comparison to the first one allowed to increase the overall accuracy of digits classification from 34.4% to 86% for testing data recorded with the use of the pointer and from 32% to 81.2% for data recorded with the use of a finger (for 50Hz sampling frequency).","PeriodicalId":441117,"journal":{"name":"2018 11th International Conference on Human System Interaction (HSI)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 11th International Conference on Human System Interaction (HSI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HSI.2018.8431246","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper we have investigated the capabilities of a quadrant photodiode based gesture sensor in the recognition of digits drawn in the air. The sensor consisting of 4 active elements, 4 LEDs and a pinhole was considered as input interface for both discrete and continuous gestures. Index finger and a round pointer were used as navigating mediums for the sensor. Experiments performed with 5 volunteers allowed to record 300 examples of each digit from 0 to 9, which were drawn in the air. Digits were converted from a list of recorded coordinates into images processed as in the MNIST database. Three approaches for recognition of digits recorded by quadrant photodiode were considered: convolutional neural network trained only on examples from the MNIST database, network trained on mixed data of MNIST with examples recorded using quadrant photodiode (4/1 proportions) and trained on the MNIST with examples recorded using the elaborated sensor but after the arbitral rejection of 20% of worst quality data (4/1 proportions preserved). The application of the third approach in comparison to the first one allowed to increase the overall accuracy of digits classification from 34.4% to 86% for testing data recorded with the use of the pointer and from 32% to 81.2% for data recorded with the use of a finger (for 50Hz sampling frequency).
基于象限光电二极管和卷积神经网络的数字识别
在本文中,我们研究了基于四象限光电二极管的手势传感器在识别空中绘制的数字中的能力。传感器由4个有源元件、4个led和一个针孔组成,作为离散和连续手势的输入接口。用食指和圆形指针作为传感器的导航媒介。在实验中,5名志愿者被允许记录300个从0到9的数字,这些数字都是在空中画的。数字从记录的坐标列表转换为在MNIST数据库中处理的图像。考虑了三种识别象限光电二极管记录的数字的方法:卷积神经网络仅在MNIST数据库的样本上训练,网络在MNIST的混合数据上训练,使用象限光电二极管(4/1比例)记录的样本,以及在MNIST上训练使用精细传感器记录的样本,但经过仲裁拒绝20%最差质量数据(保留4/1比例)。与第一种方法相比,第三种方法的应用允许将使用指针记录的测试数据的数字分类的总体准确性从34.4%提高到86%,使用手指记录的数据从32%提高到81.2% (50Hz采样频率)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信