A Wearable Ultra-Low-Power System for EEG-Based Speech-Imagery Interfaces

IF 4.9

IEEE transactions on biomedical circuits and systems Pub Date : 2025-03-23 DOI:10.1109/TBCAS.2025.3573027

Thorir Mar Ingolfsson;Victor Kartsch;Luca Benini;Andrea Cossettini

{"title":"A Wearable Ultra-Low-Power System for EEG-Based Speech-Imagery Interfaces","authors":"Thorir Mar Ingolfsson;Victor Kartsch;Luca Benini;Andrea Cossettini","doi":"10.1109/TBCAS.2025.3573027","DOIUrl":null,"url":null,"abstract":"Speech imagery—the process of mentally simulating speech without vocalization—is a promising approach for brain-computer interfaces (BCIs), enabling assistive communication for individuals with speech impairments or to enhance privacy. However, existing EEG-based speech imagery systems remain impractical for use outside specialized laboratories due to their reliance on high-channel-count and resource-intensive machine learning models running on external computing platforms. In this work, we present the first end-to-end demonstration of EEG-based speech imagery decoding on a low-channel, ultra-low-power wearable device. Building on our previous work on vowel imagery, we introduce an extended framework leveraging the BioGAP platform and <sc>VowelNet</small>, a lightweight neural network optimized for embedded speech imagery classification. In particular, we demonstrate state-of-the-art accuracy in the classification of an expanded vocabulary comprising vowels, commands, and rest states (13 classes) with a subject-specific training approach, achieving up to 50.0% for one subject (42.8% average) in multi-class classification. We deploy our model on an embedded biosignal acquisition and processing platform (BioGAP), based on the GAP9 processor, for real-time inference with minimal power consumption (25.93 mW). Our system achieves continuous execution for more than 21 hours on a small LiPo battery while maintaining classification latencies of 40.9 ms. Finally, we also explore the benefits of applying Continual Learning techniques to progressively improve the system’s performance throughout its operational lifetime, and we demonstrate that electrodes located on the temporal area contribute the most to the overall accuracy. This work marks a significant step toward practical, real-time, and unobtrusive speech imagery BCIs, unlocking new opportunities for covert communication and assistive technologies.","PeriodicalId":94031,"journal":{"name":"IEEE transactions on biomedical circuits and systems","volume":"19 4","pages":"743-755"},"PeriodicalIF":4.9000,"publicationDate":"2025-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on biomedical circuits and systems","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11012702/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Speech imagery—the process of mentally simulating speech without vocalization—is a promising approach for brain-computer interfaces (BCIs), enabling assistive communication for individuals with speech impairments or to enhance privacy. However, existing EEG-based speech imagery systems remain impractical for use outside specialized laboratories due to their reliance on high-channel-count and resource-intensive machine learning models running on external computing platforms. In this work, we present the first end-to-end demonstration of EEG-based speech imagery decoding on a low-channel, ultra-low-power wearable device. Building on our previous work on vowel imagery, we introduce an extended framework leveraging the BioGAP platform and VowelNet, a lightweight neural network optimized for embedded speech imagery classification. In particular, we demonstrate state-of-the-art accuracy in the classification of an expanded vocabulary comprising vowels, commands, and rest states (13 classes) with a subject-specific training approach, achieving up to 50.0% for one subject (42.8% average) in multi-class classification. We deploy our model on an embedded biosignal acquisition and processing platform (BioGAP), based on the GAP9 processor, for real-time inference with minimal power consumption (25.93 mW). Our system achieves continuous execution for more than 21 hours on a small LiPo battery while maintaining classification latencies of 40.9 ms. Finally, we also explore the benefits of applying Continual Learning techniques to progressively improve the system’s performance throughout its operational lifetime, and we demonstrate that electrodes located on the temporal area contribute the most to the overall accuracy. This work marks a significant step toward practical, real-time, and unobtrusive speech imagery BCIs, unlocking new opportunities for covert communication and assistive technologies.

查看原文本刊更多论文

基于脑电图语音图像接口的可穿戴超低功耗系统。

语音图像——一种不发声的心理模拟语音的过程——是一种很有前途的脑机接口（bci）方法，可以为有语言障碍的个体提供辅助交流或增强隐私。然而，现有的基于脑电图的语音图像系统仍然不适合在专业实验室之外使用，因为它们依赖于运行在外部计算平台上的高通道计数和资源密集型机器学习模型。在这项工作中，我们在低信道、超低功耗可穿戴设备上首次展示了基于脑电图的语音图像解码的端到端演示。基于我们之前在元音图像方面的工作，我们引入了一个扩展框架，利用BioGAP平台和VOWELNET，这是一个针对嵌入式语音图像分类进行优化的轻量级神经网络。特别是，我们通过特定学科的训练方法，在包含元音、命令和休息状态（13个类）的扩展词汇的分类中展示了最先进的准确性，在多类分类中，一个学科的分类准确率高达50.0%（平均42.8%）。我们将我们的模型部署在基于GAP9处理器的嵌入式生物信号采集和处理平台（BioGAP）上，以最小的功耗（25.93 mW）进行实时推断。我们的系统在一个小的LiPo电池上实现了超过21小时的连续执行，同时保持了40.9 ms的分类延迟。最后，我们还探讨了应用持续学习技术在整个操作寿命期间逐步提高系统性能的好处，并且我们证明了位于颞区的电极对整体准确性的贡献最大。这项工作标志着向实用、实时和不显眼的语音图像bci迈出了重要的一步，为隐蔽通信和辅助技术打开了新的机会。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on biomedical circuits and systems

自引率

0.00%

发文量