A Computation Resource Friendly Convolutional Neural Network Engine For EEG-based Emotion Recognition

2019 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA) Pub Date : 2019-06-01 DOI:10.1109/CIVEMSA45640.2019.9071594

Yi-Ju Zhan, M. Vai, S. Barma, S. Pun, Jia Wen Li, P. Mak

{"title":"A Computation Resource Friendly Convolutional Neural Network Engine For EEG-based Emotion Recognition","authors":"Yi-Ju Zhan, M. Vai, S. Barma, S. Pun, Jia Wen Li, P. Mak","doi":"10.1109/CIVEMSA45640.2019.9071594","DOIUrl":null,"url":null,"abstract":"EEG-based Emotion recognition is a crucial link in Human-Computer Interaction (HCI) application. Nowadays, Convolutional Neural Network (CNN) and its related CNN-hybrid approaches have achieved the state-of-art accuracy in this field. However, most of these existing techniques employ large-scale neural networks which cause performance bottleneck in portable systems. Moreover, traditional convolution kernel confuses EEG multiple frequency bands information, which is critical for investigating emotion status. To improve these issues, firstly, we extract power spectral features from four frequency bands (θ,α,β,γ) and transform obtained features into cortex-like frames while preserving spatial information of electrodes position, so that the multi-channel, multi-frequency bands and time series EEG signals can be efficiently represented. Then, we design a shallow depthwise parallel CNN inspired by Mobilenet technique to learn spatial representation from labeled frames. Segment-level emotion recognition experiments are implemented to verify the proposed architecture with DEAP database. Our approach achieves the competitive accuracy of 84.07% and 82.95% on arousal and valence respectively. Besides, the experimental results prove the computation-effectiveness of the proposed method. Compared with the state-of-art approach, our approach saves 69.23% GPU memory and reduces 30% GPU peak utilization with only 6.5% accuracy drop. Therefore, our method shows extensive application prospects for EEG-based emotion recognition on resource-limited devices.","PeriodicalId":293990,"journal":{"name":"2019 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIVEMSA45640.2019.9071594","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

EEG-based Emotion recognition is a crucial link in Human-Computer Interaction (HCI) application. Nowadays, Convolutional Neural Network (CNN) and its related CNN-hybrid approaches have achieved the state-of-art accuracy in this field. However, most of these existing techniques employ large-scale neural networks which cause performance bottleneck in portable systems. Moreover, traditional convolution kernel confuses EEG multiple frequency bands information, which is critical for investigating emotion status. To improve these issues, firstly, we extract power spectral features from four frequency bands (θ,α,β,γ) and transform obtained features into cortex-like frames while preserving spatial information of electrodes position, so that the multi-channel, multi-frequency bands and time series EEG signals can be efficiently represented. Then, we design a shallow depthwise parallel CNN inspired by Mobilenet technique to learn spatial representation from labeled frames. Segment-level emotion recognition experiments are implemented to verify the proposed architecture with DEAP database. Our approach achieves the competitive accuracy of 84.07% and 82.95% on arousal and valence respectively. Besides, the experimental results prove the computation-effectiveness of the proposed method. Compared with the state-of-art approach, our approach saves 69.23% GPU memory and reduces 30% GPU peak utilization with only 6.5% accuracy drop. Therefore, our method shows extensive application prospects for EEG-based emotion recognition on resource-limited devices.

查看原文本刊更多论文

一种计算资源友好的基于脑电图的情感识别卷积神经网络引擎

基于脑电图的情感识别是人机交互(HCI)应用的关键环节。目前，卷积神经网络(CNN)及其相关的CNN-混合方法在这一领域已经达到了最先进的精度。然而，这些现有的技术大多采用大规模的神经网络，这在便携式系统中造成了性能瓶颈。此外，传统的卷积核混淆了脑电信号的多个频段信息，这对研究情绪状态至关重要。为了改善这些问题，首先，我们提取了θ、α、β、γ四个频段的功率谱特征，在保留电极位置空间信息的前提下，将得到的特征转化为类皮质帧，从而有效地表示多通道、多频段和时间序列的脑电信号。然后，我们设计了一个受Mobilenet技术启发的浅深度并行CNN，从标记帧中学习空间表示。在DEAP数据库上进行了分段级情感识别实验，验证了所提出的结构。该方法在唤起和效价上的竞争正确率分别为84.07%和82.95%。实验结果证明了该方法的计算有效性。与最先进的方法相比，我们的方法节省了69.23%的GPU内存，降低了30%的GPU峰值利用率，而准确率仅下降了6.5%。因此，我们的方法在资源有限的设备上基于脑电图的情感识别具有广泛的应用前景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA)

自引率

0.00%

发文量