Speech imagery brain-computer interfaces: a systematic literature review.

Journal of neural engineering Pub Date : 2025-06-26 DOI:10.1088/1741-2552/ade28e

A Tates, A Matran-Fernandez, S Halder, I Daly

{"title":"Speech imagery brain-computer interfaces: a systematic literature review.","authors":"A Tates, A Matran-Fernandez, S Halder, I Daly","doi":"10.1088/1741-2552/ade28e","DOIUrl":null,"url":null,"abstract":"Objective:Speech Imagery (SI) refers to the mental experience of hearing speech and may be the core of verbal thinking for people who undergo internal monologues. It belongs to the set of possible mental imagery states that produce kinesthetic experiences whose sensations are similar to their non-imagery counterparts. SI underpins language processes and may have similar building blocks to overt speech without the final articulatory outcome. The kinesthetic experience of SI has been proposed to be a projection of the expected articulatory outcome in a top-down processing manner. As SI seems to be a core human cognitive task it has been proposed as a paradigm for Brain-Computer Interfaces (BCI). One important aspect of BCI designs is usability, and SI may present an intuitive paradigm, which has brought the attention of researchers to attempt to decode SI from brain signals. In this paper we review the important aspects of SI-BCI decoding pipelines.Approach. We conducted this review according to the Preferred Reporting Items for Systematic reviews and Meta-Analysis guidelines. Specifically, we filtered peer-reviewed reports via a search of Google Scholar and PubMed. We selected a total of 104 reports that attempted to decode SI from neural activity.Main results. Our review reveals a growing interest in SI decoding in the last 20 years, and shows how different neuroimaging modalities have been employed to record SI in distinct ways to instruct participants to perform this task. We discuss the signal processing methods used along with feature extraction techniques and found a high preference for Deep Learning models. We have summarized and compared the decoding attempts by quantifying the efficacy of decoding by measuring Information Transfer Rates. Notably, fewer than 6% of studies reported real-time decoding, with the vast majority focused on offline analyses. This suggests existing challenges of this paradigm, as the variety of approaches and outcomes prevents a clear identification of the field's current state-of-the-art. We offer a discussion of future research directions.SignificanceSI is an attractive BCI paradigm. This review outlines the increasing interest in SI, the methodological trends, the efficacy of different approaches, and the current progress toward real-time decoding systems.","PeriodicalId":94096,"journal":{"name":"Journal of neural engineering","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of neural engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/1741-2552/ade28e","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Objective:Speech Imagery (SI) refers to the mental experience of hearing speech and may be the core of verbal thinking for people who undergo internal monologues. It belongs to the set of possible mental imagery states that produce kinesthetic experiences whose sensations are similar to their non-imagery counterparts. SI underpins language processes and may have similar building blocks to overt speech without the final articulatory outcome. The kinesthetic experience of SI has been proposed to be a projection of the expected articulatory outcome in a top-down processing manner. As SI seems to be a core human cognitive task it has been proposed as a paradigm for Brain-Computer Interfaces (BCI). One important aspect of BCI designs is usability, and SI may present an intuitive paradigm, which has brought the attention of researchers to attempt to decode SI from brain signals. In this paper we review the important aspects of SI-BCI decoding pipelines.Approach. We conducted this review according to the Preferred Reporting Items for Systematic reviews and Meta-Analysis guidelines. Specifically, we filtered peer-reviewed reports via a search of Google Scholar and PubMed. We selected a total of 104 reports that attempted to decode SI from neural activity.Main results. Our review reveals a growing interest in SI decoding in the last 20 years, and shows how different neuroimaging modalities have been employed to record SI in distinct ways to instruct participants to perform this task. We discuss the signal processing methods used along with feature extraction techniques and found a high preference for Deep Learning models. We have summarized and compared the decoding attempts by quantifying the efficacy of decoding by measuring Information Transfer Rates. Notably, fewer than 6% of studies reported real-time decoding, with the vast majority focused on offline analyses. This suggests existing challenges of this paradigm, as the variety of approaches and outcomes prevents a clear identification of the field's current state-of-the-art. We offer a discussion of future research directions.SignificanceSI is an attractive BCI paradigm. This review outlines the increasing interest in SI, the methodological trends, the efficacy of different approaches, and the current progress toward real-time decoding systems.

查看原文本刊更多论文

语音图像脑机接口：系统文献综述。

言语意象（Speech Imagery， SI）是指听到言语的心理体验，可能是经历内心独白的人言语思维的核心。它属于一组可能的心理意象状态，这些状态产生的动觉体验与其非意象对应的感觉相似。SI是语言过程的基础，可能与没有最终发音结果的显性语言有类似的构建模块。科学探究的动觉体验被认为是以自上而下的加工方式对预期发音结果的投射。由于SI似乎是一项核心的人类认知任务，它已被提出作为脑机接口（BCI）的范式。脑机接口设计的一个重要方面是可用性，而SI可能呈现出一种直观的范式，这引起了研究人员试图从大脑信号中解码SI的注意。本文综述了SI-BCI解码管道的重要方面。\textit{方法}。我们根据系统评价和荟萃分析的首选报告项目（PRISMA）指南进行了本综述。具体来说，我们通过搜索谷歌Scholar和PubMed来过滤同行评议的报告。我们总共选择了104份试图从神经活动中解码语音图像的报告。\textit{主要结果}。我们的回顾揭示了在过去的20年里，人们对SI解码的兴趣越来越大，并展示了不同的神经成像模式是如何以不同的方式记录SI来指导参与者执行这项任务的。我们讨论了与特征提取技术一起使用的信号处理方法，并发现深度学习模型具有很高的偏好。我们通过测量信息传输速率来量化解码的有效性，总结和比较了解码的尝试。值得注意的是，只有不到6％的研究报告了实时解码，绝大多数研究集中在离线分析上。这表明了该范式存在的挑战，因为各种方法和结果阻碍了对该领域当前最先进技术的清晰识别。并对未来的研究方向进行了讨论。\textit{意义}言语意象是一种极具吸引力的脑机接口范式。这篇综述概述了对实时解码系统日益增长的兴趣、方法趋势、不同方法的功效以及当前的进展。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of neural engineering

自引率

0.00%

发文量