基于高性能动态asic的音频信号特征提取

2016 International Conference on Advanced Computing and Applications (ACOMP) Pub Date : 2016-11-01 DOI:10.1109/ACOMP.2016.025

Tam Chi Nguyen, L. Pham, H. M. Nguyen, Bao Bui, D. Ngo, Trang Hoang

{"title":"基于高性能动态asic的音频信号特征提取","authors":"Tam Chi Nguyen, L. Pham, H. M. Nguyen, Bao Bui, D. Ngo, Trang Hoang","doi":"10.1109/ACOMP.2016.025","DOIUrl":null,"url":null,"abstract":"State-of-the-art speech recognition, speech analysis as well as music modeling have approached Mel-Frequency Cepstral Coefficient (MFCC) and confirmed great performance in comparison to other feature extractions. Based on obtained software performance, a wide range of hardware designs are applied to highly increasing integrated systems achieving real-time performance and ability of mobility. Nevertheless, most hardware approaches witnessing certain configurations have experienced limitation of functions due to fixed-point format, strict silicon requirements or exact applications, which is reasonable for low ability of reusing and high cost of product. As regards MFCC method, there are various concerning parameters such as number of samples, range of filter bands, Fast Fourier Transform (FFT) number, number of cepstrums or even different level of delta coefficients, which significantly affect final performance of entire applications. As a result, a dynamic ASIC-based MFCC hardware architecture is proposed in this paper in order to meet real-time system requiring high performance as well as confirm superiorities regarding to ability of reconfiguration feasibly through an Advance High-performance Bus (AHB) interface in chip level instead of modifying at Register Transfer Level (RTL) in developed duration. Besides, have not only experiments on 130nm technology with full ASIC design flow witnessed high frequency at 500MHz but applying IEEE 754 floating-point format has also confirmed great accuracy between hardware design and software design, which apply in certain application towards Vietnam automatic speech recognition (ASR).","PeriodicalId":133451,"journal":{"name":"2016 International Conference on Advanced Computing and Applications (ACOMP)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A High Performance Dynamic ASIC-Based Audio Signal Feature Extraction (MFCC)\",\"authors\":\"Tam Chi Nguyen, L. Pham, H. M. Nguyen, Bao Bui, D. Ngo, Trang Hoang\",\"doi\":\"10.1109/ACOMP.2016.025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"State-of-the-art speech recognition, speech analysis as well as music modeling have approached Mel-Frequency Cepstral Coefficient (MFCC) and confirmed great performance in comparison to other feature extractions. Based on obtained software performance, a wide range of hardware designs are applied to highly increasing integrated systems achieving real-time performance and ability of mobility. Nevertheless, most hardware approaches witnessing certain configurations have experienced limitation of functions due to fixed-point format, strict silicon requirements or exact applications, which is reasonable for low ability of reusing and high cost of product. As regards MFCC method, there are various concerning parameters such as number of samples, range of filter bands, Fast Fourier Transform (FFT) number, number of cepstrums or even different level of delta coefficients, which significantly affect final performance of entire applications. As a result, a dynamic ASIC-based MFCC hardware architecture is proposed in this paper in order to meet real-time system requiring high performance as well as confirm superiorities regarding to ability of reconfiguration feasibly through an Advance High-performance Bus (AHB) interface in chip level instead of modifying at Register Transfer Level (RTL) in developed duration. Besides, have not only experiments on 130nm technology with full ASIC design flow witnessed high frequency at 500MHz but applying IEEE 754 floating-point format has also confirmed great accuracy between hardware design and software design, which apply in certain application towards Vietnam automatic speech recognition (ASR).\",\"PeriodicalId\":133451,\"journal\":{\"name\":\"2016 International Conference on Advanced Computing and Applications (ACOMP)\",\"volume\":\"46 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 International Conference on Advanced Computing and Applications (ACOMP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACOMP.2016.025\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Advanced Computing and Applications (ACOMP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACOMP.2016.025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

最先进的语音识别，语音分析以及音乐建模已经接近Mel-Frequency Cepstral系数(MFCC)，并且与其他特征提取相比证实了出色的性能。基于获得的软件性能，广泛的硬件设计被应用于高度增长的集成系统，以实现实时性能和移动能力。然而，由于定点格式、严格的硅要求或精确的应用，大多数见证某些配置的硬件方法都经历了功能限制，这是可重用性低和产品成本高的合理原因。对于MFCC方法，涉及到的参数有很多，比如采样个数、滤波器频带范围、快速傅里叶变换(Fast Fourier Transform, FFT)个数、倒频谱个数甚至是不同水平的δ系数，这些都会对整个应用的最终性能产生很大的影响。为此，本文提出了一种基于动态asic的MFCC硬件体系结构，以满足实时系统对高性能的要求，并确认其优势，即通过芯片级的高级高性能总线(advanced performance Bus, AHB)接口，而不是在开发期间在寄存器传输级(Register Transfer level, RTL)进行修改，可以实现可重构。此外，在130nm工艺下进行了500MHz高频全ASIC设计流程的实验，并采用IEEE 754浮点格式，验证了硬件设计和软件设计的准确性，在越南自动语音识别(ASR)中有一定的应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A High Performance Dynamic ASIC-Based Audio Signal Feature Extraction (MFCC)

State-of-the-art speech recognition, speech analysis as well as music modeling have approached Mel-Frequency Cepstral Coefficient (MFCC) and confirmed great performance in comparison to other feature extractions. Based on obtained software performance, a wide range of hardware designs are applied to highly increasing integrated systems achieving real-time performance and ability of mobility. Nevertheless, most hardware approaches witnessing certain configurations have experienced limitation of functions due to fixed-point format, strict silicon requirements or exact applications, which is reasonable for low ability of reusing and high cost of product. As regards MFCC method, there are various concerning parameters such as number of samples, range of filter bands, Fast Fourier Transform (FFT) number, number of cepstrums or even different level of delta coefficients, which significantly affect final performance of entire applications. As a result, a dynamic ASIC-based MFCC hardware architecture is proposed in this paper in order to meet real-time system requiring high performance as well as confirm superiorities regarding to ability of reconfiguration feasibly through an Advance High-performance Bus (AHB) interface in chip level instead of modifying at Register Transfer Level (RTL) in developed duration. Besides, have not only experiments on 130nm technology with full ASIC design flow witnessed high frequency at 500MHz but applying IEEE 754 floating-point format has also confirmed great accuracy between hardware design and software design, which apply in certain application towards Vietnam automatic speech recognition (ASR).

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 International Conference on Advanced Computing and Applications (ACOMP)

自引率

0.00%

发文量