{"title":"基于多接受CNN的异步脑机接口通信和神经康复想象语音检测。","authors":"Byung-Kwan Ko;Seo-Hyun Lee;Seong-Whan Lee","doi":"10.1109/TNSRE.2025.3592312","DOIUrl":null,"url":null,"abstract":"Imagined speech-based brain-computer interface (BCI) facilitates brain signal-driven intuitive communication which holds great promise as an effective speech rehabilitation tool, enabling real-time, hands-free interaction for individuals with speech and motor impairments. While speech-based assistant systems rely on wake-word detection (e.g., “Hey Siri”), BCI-based communication system must capture imagined onset from EEG signals to turn on the ‘brain switch’ to further convey user’s imagined command. Nevertheless, the absence of reliable ground truth for the endogenous paradigm adds to the complexity to train the model to capture exact onset from continuous EEG. To address these issues, we introduce a multi-receptive field convolutional neural network, designed to capture speech and idle states based on behaviorally-aligned EEG features. We propose a voice-based ground truth alignment method with voting strategy that aims to synchronize imagined speech with overt speech onset and offset, providing a structured approach for capturing speech events in asynchronous BCI systems. Furthermore, spectral and phonological analyses revealed that beta and alpha bands, as well as syllable count, appear to influence speech state discriminability. Evaluations on imagined and overt speech tasks, including pseudo-online experiments, demonstrate the potential to enhance asynchronous BCI systems, supporting real-time communication for both healthy and impaired individuals.","PeriodicalId":13419,"journal":{"name":"IEEE Transactions on Neural Systems and Rehabilitation Engineering","volume":"33 ","pages":"2904-2914"},"PeriodicalIF":5.2000,"publicationDate":"2025-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11095808","citationCount":"0","resultStr":"{\"title\":\"Imagined Speech Detection Using Multi-Receptive CNN for Asynchronous BCI Communication and Neurorehabilitation\",\"authors\":\"Byung-Kwan Ko;Seo-Hyun Lee;Seong-Whan Lee\",\"doi\":\"10.1109/TNSRE.2025.3592312\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Imagined speech-based brain-computer interface (BCI) facilitates brain signal-driven intuitive communication which holds great promise as an effective speech rehabilitation tool, enabling real-time, hands-free interaction for individuals with speech and motor impairments. While speech-based assistant systems rely on wake-word detection (e.g., “Hey Siri”), BCI-based communication system must capture imagined onset from EEG signals to turn on the ‘brain switch’ to further convey user’s imagined command. Nevertheless, the absence of reliable ground truth for the endogenous paradigm adds to the complexity to train the model to capture exact onset from continuous EEG. To address these issues, we introduce a multi-receptive field convolutional neural network, designed to capture speech and idle states based on behaviorally-aligned EEG features. We propose a voice-based ground truth alignment method with voting strategy that aims to synchronize imagined speech with overt speech onset and offset, providing a structured approach for capturing speech events in asynchronous BCI systems. Furthermore, spectral and phonological analyses revealed that beta and alpha bands, as well as syllable count, appear to influence speech state discriminability. Evaluations on imagined and overt speech tasks, including pseudo-online experiments, demonstrate the potential to enhance asynchronous BCI systems, supporting real-time communication for both healthy and impaired individuals.\",\"PeriodicalId\":13419,\"journal\":{\"name\":\"IEEE Transactions on Neural Systems and Rehabilitation Engineering\",\"volume\":\"33 \",\"pages\":\"2904-2914\"},\"PeriodicalIF\":5.2000,\"publicationDate\":\"2025-07-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11095808\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Neural Systems and Rehabilitation Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11095808/\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Neural Systems and Rehabilitation Engineering","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11095808/","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
Imagined Speech Detection Using Multi-Receptive CNN for Asynchronous BCI Communication and Neurorehabilitation
Imagined speech-based brain-computer interface (BCI) facilitates brain signal-driven intuitive communication which holds great promise as an effective speech rehabilitation tool, enabling real-time, hands-free interaction for individuals with speech and motor impairments. While speech-based assistant systems rely on wake-word detection (e.g., “Hey Siri”), BCI-based communication system must capture imagined onset from EEG signals to turn on the ‘brain switch’ to further convey user’s imagined command. Nevertheless, the absence of reliable ground truth for the endogenous paradigm adds to the complexity to train the model to capture exact onset from continuous EEG. To address these issues, we introduce a multi-receptive field convolutional neural network, designed to capture speech and idle states based on behaviorally-aligned EEG features. We propose a voice-based ground truth alignment method with voting strategy that aims to synchronize imagined speech with overt speech onset and offset, providing a structured approach for capturing speech events in asynchronous BCI systems. Furthermore, spectral and phonological analyses revealed that beta and alpha bands, as well as syllable count, appear to influence speech state discriminability. Evaluations on imagined and overt speech tasks, including pseudo-online experiments, demonstrate the potential to enhance asynchronous BCI systems, supporting real-time communication for both healthy and impaired individuals.
期刊介绍:
Rehabilitative and neural aspects of biomedical engineering, including functional electrical stimulation, acoustic dynamics, human performance measurement and analysis, nerve stimulation, electromyography, motor control and stimulation; and hardware and software applications for rehabilitation engineering and assistive devices.