Synthesizing Speech by Decoding Intracortical Neural Activity from Dorsal Motor Cortex

2023 11th International IEEE/EMBS Conference on Neural Engineering (NER) Pub Date : 2023-04-24 DOI:10.1109/NER52421.2023.10123880

M. Wairagkar, L. Hochberg, D. Brandman, S. Stavisky

{"title":"Synthesizing Speech by Decoding Intracortical Neural Activity from Dorsal Motor Cortex","authors":"M. Wairagkar, L. Hochberg, D. Brandman, S. Stavisky","doi":"10.1109/NER52421.2023.10123880","DOIUrl":null,"url":null,"abstract":"Losing the ability to speak due to brain injury or neurodegenerative diseases such as ALS can be debilitating. Brain-computer interfaces could potentially provide affected individuals a fast and intuitive way to communicate by decoding speech-related neural activity into a computer-synthesized voice. Current intracortical BCIs for communication using handwriting or point-and-click typing are substantially slower than natural speech and do not capture the full expressive range of speech. Recent studies have identified speech features from ECoG and sEEG recordings; however, intelligible speech synthesis has not yet been demonstrated. Our previous work has shown speech-related patterns in intracortical recordings from dorsal (arm/hand) motor cortex that enabled discrete word/phoneme classification. This motivates exploring an intracortical approach for continuous voice synthesis. Here, we present a neural decoding framework to synthesize speech by directly translating neural activity recorded from human motor cortex using intracortical multielectrode arrays into a low-dimensional speech feature space from which voice is synthesized.","PeriodicalId":201841,"journal":{"name":"2023 11th International IEEE/EMBS Conference on Neural Engineering (NER)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 11th International IEEE/EMBS Conference on Neural Engineering (NER)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NER52421.2023.10123880","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Losing the ability to speak due to brain injury or neurodegenerative diseases such as ALS can be debilitating. Brain-computer interfaces could potentially provide affected individuals a fast and intuitive way to communicate by decoding speech-related neural activity into a computer-synthesized voice. Current intracortical BCIs for communication using handwriting or point-and-click typing are substantially slower than natural speech and do not capture the full expressive range of speech. Recent studies have identified speech features from ECoG and sEEG recordings; however, intelligible speech synthesis has not yet been demonstrated. Our previous work has shown speech-related patterns in intracortical recordings from dorsal (arm/hand) motor cortex that enabled discrete word/phoneme classification. This motivates exploring an intracortical approach for continuous voice synthesis. Here, we present a neural decoding framework to synthesize speech by directly translating neural activity recorded from human motor cortex using intracortical multielectrode arrays into a low-dimensional speech feature space from which voice is synthesized.

查看原文本刊更多论文

解码背侧运动皮层皮层内神经活动的语音合成

由于脑损伤或神经退行性疾病(如ALS)而失去说话能力可能会使人衰弱。脑机接口可以通过将与语音相关的神经活动解码成计算机合成的声音，为受影响的个人提供一种快速直观的交流方式。目前用于手写或点击打字交流的脑皮质内脑机接口比自然语音慢得多，并且不能捕捉语音的全部表达范围。最近的研究已经从ECoG和sEEG记录中确定了语音特征;然而，可理解的语音合成尚未得到证实。我们之前的研究表明，在背侧(手臂/手)运动皮层的皮层内记录中，语言相关的模式使离散的单词/音素分类成为可能。这激发了对连续语音合成的皮质内方法的探索。在这里，我们提出了一个神经解码框架，通过使用皮质内多电极阵列将人类运动皮层记录的神经活动直接翻译成低维语音特征空间，从而合成语音。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 11th International IEEE/EMBS Conference on Neural Engineering (NER)

自引率

0.00%

发文量