基于统计接近准则的无声语音识别任务脑命令字典优化

Q4 Computer Science
Alexandra Bernadotte, Alexandr D. Mazurin
{"title":"基于统计接近准则的无声语音识别任务脑命令字典优化","authors":"Alexandra Bernadotte, Alexandr D. Mazurin","doi":"10.20537/2076-7633-2023-15-3-675-690","DOIUrl":null,"url":null,"abstract":"In our research, we focus on the problem of classification for silent speech recognition to develop a brain– computer interface (BCI) based on electroencephalographic (EEG) data, which will be capable of assisting people with mental and physical disabilities and expanding human capabilities in everyday life. Our previous research has shown that the silent pronouncing of some words results in almost identical distributions of electroencephalographic signal data. Such a phenomenon has a suppressive impact on the quality of neural network model behavior. This paper proposes a data processing technique that distinguishes between statistically remote and inseparable classes in the dataset. Applying the proposed approach helps us reach the goal of maximizing the semantic load of the dictionary used in BCI. Furthermore, we propose the existence of a statistical predictive criterion for the accuracy of binary classification of the words in a dictionary. Such a criterion aims to estimate the lower and the upper bounds of classifiers’ behavior only by measuring quantitative statistical properties of the data (in particular, using the Kolmogorov– Smirnov method). We show that higher levels of classification accuracy can be achieved by means of applying the proposed predictive criterion, making it possible to form an optimized dictionary in terms of semantic load for the EEG-based BCIs. Furthermore, using such a dictionary as a training dataset for classification problems grants the statistical remoteness of the classes by taking into account the semantic and phonetic properties of the corresponding words and improves the classification behavior of silent speech recognition models.","PeriodicalId":37429,"journal":{"name":"Computer Research and Modeling","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Optimization of the brain command dictionary based on the statistical proximity criterion in silent speech recognition task\",\"authors\":\"Alexandra Bernadotte, Alexandr D. Mazurin\",\"doi\":\"10.20537/2076-7633-2023-15-3-675-690\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In our research, we focus on the problem of classification for silent speech recognition to develop a brain– computer interface (BCI) based on electroencephalographic (EEG) data, which will be capable of assisting people with mental and physical disabilities and expanding human capabilities in everyday life. Our previous research has shown that the silent pronouncing of some words results in almost identical distributions of electroencephalographic signal data. Such a phenomenon has a suppressive impact on the quality of neural network model behavior. This paper proposes a data processing technique that distinguishes between statistically remote and inseparable classes in the dataset. Applying the proposed approach helps us reach the goal of maximizing the semantic load of the dictionary used in BCI. Furthermore, we propose the existence of a statistical predictive criterion for the accuracy of binary classification of the words in a dictionary. Such a criterion aims to estimate the lower and the upper bounds of classifiers’ behavior only by measuring quantitative statistical properties of the data (in particular, using the Kolmogorov– Smirnov method). We show that higher levels of classification accuracy can be achieved by means of applying the proposed predictive criterion, making it possible to form an optimized dictionary in terms of semantic load for the EEG-based BCIs. Furthermore, using such a dictionary as a training dataset for classification problems grants the statistical remoteness of the classes by taking into account the semantic and phonetic properties of the corresponding words and improves the classification behavior of silent speech recognition models.\",\"PeriodicalId\":37429,\"journal\":{\"name\":\"Computer Research and Modeling\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Research and Modeling\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.20537/2076-7633-2023-15-3-675-690\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Research and Modeling","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20537/2076-7633-2023-15-3-675-690","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0

摘要

本研究主要针对无声语音识别的分类问题,开发一种基于脑电图(EEG)数据的脑机接口(BCI),为身心残障人士提供辅助,拓展人类日常生活能力。我们之前的研究表明,一些单词的不发音会导致脑电图信号数据几乎相同的分布。这种现象对神经网络模型的行为质量有抑制作用。本文提出了一种数据处理技术,用于区分数据集中统计上的远程类和不可分割类。应用所提出的方法可以帮助我们达到最大化脑机接口中使用的字典的语义负载的目标。此外,我们提出了字典中词的二分类准确性的统计预测准则的存在性。该准则旨在仅通过测量数据的定量统计特性(特别是使用Kolmogorov - Smirnov方法)来估计分类器行为的下界和上界。我们表明,通过应用提出的预测标准可以实现更高水平的分类精度,从而可以根据基于脑电图的脑机接口的语义负载形成优化的字典。此外,使用这样的字典作为分类问题的训练数据集,通过考虑相应单词的语义和语音属性,赋予类的统计距离性,并改善无声语音识别模型的分类行为。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Optimization of the brain command dictionary based on the statistical proximity criterion in silent speech recognition task
In our research, we focus on the problem of classification for silent speech recognition to develop a brain– computer interface (BCI) based on electroencephalographic (EEG) data, which will be capable of assisting people with mental and physical disabilities and expanding human capabilities in everyday life. Our previous research has shown that the silent pronouncing of some words results in almost identical distributions of electroencephalographic signal data. Such a phenomenon has a suppressive impact on the quality of neural network model behavior. This paper proposes a data processing technique that distinguishes between statistically remote and inseparable classes in the dataset. Applying the proposed approach helps us reach the goal of maximizing the semantic load of the dictionary used in BCI. Furthermore, we propose the existence of a statistical predictive criterion for the accuracy of binary classification of the words in a dictionary. Such a criterion aims to estimate the lower and the upper bounds of classifiers’ behavior only by measuring quantitative statistical properties of the data (in particular, using the Kolmogorov– Smirnov method). We show that higher levels of classification accuracy can be achieved by means of applying the proposed predictive criterion, making it possible to form an optimized dictionary in terms of semantic load for the EEG-based BCIs. Furthermore, using such a dictionary as a training dataset for classification problems grants the statistical remoteness of the classes by taking into account the semantic and phonetic properties of the corresponding words and improves the classification behavior of silent speech recognition models.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computer Research and Modeling
Computer Research and Modeling Computer Science-Computational Theory and Mathematics
CiteScore
0.80
自引率
0.00%
发文量
82
审稿时长
15 weeks
期刊介绍: The journal publishes original research papers and review articles in the field of computer research and mathematical modeling in physics, engineering, biology, ecology, economics, psychology etc. The journal covers research on computer methods and simulation of systems of various nature in the leading scientific schools of Russia and other countries. Of particular interest are papers devoted to simulation in thriving fields of science such as nanotechnology, bioinformatics, and econophysics. The main goal of the journal is to cover the development of computer and mathematical methods for the study of processes in complex structured and developing systems. The primary criterion for publication of papers in the journal is their scientific level. The journal does not charge a publication fee. The decision made on publication is based on the results of an independent review. The journal is oriented towards a wide readership – specialists in mathematical modeling in various areas of science and engineering. The scope of the journal includes: — mathematical modeling and numerical simulation; — numerical methods and the basics of their application; — models in physics and technology; — analysis and modeling of complex living systems; — models of economic and social systems. New sections and headings may be included in the next volumes.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信