Hardware Architecture of Emotion Recognition from Speech Features using Recurrent Neural Network and Backpropagation Through Time

2019 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) Pub Date : 2019-12-01 DOI:10.1109/ISPACS48206.2019.8986342

Joshua Gunawan, Teresia R. S. Putri, Yashael F. Arthanto, T. Adiono

引用次数: 0

Abstract

Emotion recognition from speech feature is one of the application where the system needs temporal information in order to produce a correct prediction. On the other hand, recurrent neural network has the advantage of retaining temporal information. This paper proposed a hardware architecture design for emotion recognition system using LSTM (Long Short Term Memory) and BPTT (Backpropagation Through Time). For this application, the proposed architecture consists of a two-layer stacked LSTM with 53 cells on the first layer and 8 cells on the second layer. The architecture is implemented and verified using Verilog language and FPGA.

查看原文本刊更多论文

基于递归神经网络和时间反向传播的语音特征情感识别硬件架构

基于语音特征的情感识别是系统需要时间信息才能做出正确预测的应用之一。另一方面，递归神经网络具有保留时间信息的优点。提出了一种基于LSTM(长短期记忆)和BPTT(时间反向传播)的情感识别系统的硬件架构设计。对于这个应用程序，提出的体系结构包括一个两层堆叠的LSTM，第一层有53个单元，第二层有8个单元。采用Verilog语言和FPGA对该体系结构进行了实现和验证。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)

自引率

0.00%

发文量