{"title":"Reconstruction of speech spectrogram based on non-invasive EEG signal","authors":"Di Zhou, M. Unoki, Gaoyan Zhang, J. Dang","doi":"10.1109/ISCSLP57327.2022.10038234","DOIUrl":null,"url":null,"abstract":"Decoding neural activity into speech could enable natural conversations for people who are unable to communicate as a result of neurological diseases. Studies have proven that speech could be directly recognized or synthesized from intracranial recordings. However, intracranial electrocorticography is invasive, thus not comfortable for patients. By the acoustic representation of speech in the high-level brain cortex, we successfully reconstructed a speech spectrogram from non-invasive electroencephalography (EEG), which has similar accuracy to previous intracranial recording. As well as the reported superior temporal gyrus, premotor cortex, and inferior frontal gyrus, we also found speech representations in several other cortices such as an entorhinal, fusiform, and temporal pole. The intelligibility of the recovered speech in this study was not high enough, however, our findings show a possibility to reconstruct speech from non-invasive EEG in the future.","PeriodicalId":246698,"journal":{"name":"2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCSLP57327.2022.10038234","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Decoding neural activity into speech could enable natural conversations for people who are unable to communicate as a result of neurological diseases. Studies have proven that speech could be directly recognized or synthesized from intracranial recordings. However, intracranial electrocorticography is invasive, thus not comfortable for patients. By the acoustic representation of speech in the high-level brain cortex, we successfully reconstructed a speech spectrogram from non-invasive electroencephalography (EEG), which has similar accuracy to previous intracranial recording. As well as the reported superior temporal gyrus, premotor cortex, and inferior frontal gyrus, we also found speech representations in several other cortices such as an entorhinal, fusiform, and temporal pole. The intelligibility of the recovered speech in this study was not high enough, however, our findings show a possibility to reconstruct speech from non-invasive EEG in the future.