{"title":"An automatic speaker-speech recognition system for friendly HMI based on binary halved clustering","authors":"Chih-Hsiang Peng, Chih-Hung Chou, Ta-Wen Kuan, Po-Chuan Lin, Jhing-Fa Wang, P. Yu","doi":"10.1109/ICOT.2014.6956624","DOIUrl":null,"url":null,"abstract":"This work presents a low-cost and fast-trainable automatic speaker-speech recognition (ASSR) system, by proposed binary halved clustering (BHC) method for human-machine interface (HMI) on an embedded platform, owing to the trait of low cost in ASSR system is essential and affordable for real-world application. In addition, fast-trainable ability can provide fast responding time. The reduction of waiting time makes the proposed HMI to be friendly for users. The speech recognition uses enhanced cross-word reference templates (ECWRTs) for template training type. The novel BHC method uses binary-halved splitting to generate speaker models for low complexity requirement. The regularity of binary halved behavior is beneficial for data scheduling and resource sharing in the embedded ASSR system. Compared with the conventional works, simulation results indicate that the proposed hardware accelerator achieves 28% less cost, 90% less responding time, an ASSR accuracy of 90%. Comparison exhibits that performance of the proposed system is greater than the conventional works, thereby demonstrating the friendly and affordable factor of the proposed HMI.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"BME-26 10","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Orange Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOT.2014.6956624","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This work presents a low-cost and fast-trainable automatic speaker-speech recognition (ASSR) system, by proposed binary halved clustering (BHC) method for human-machine interface (HMI) on an embedded platform, owing to the trait of low cost in ASSR system is essential and affordable for real-world application. In addition, fast-trainable ability can provide fast responding time. The reduction of waiting time makes the proposed HMI to be friendly for users. The speech recognition uses enhanced cross-word reference templates (ECWRTs) for template training type. The novel BHC method uses binary-halved splitting to generate speaker models for low complexity requirement. The regularity of binary halved behavior is beneficial for data scheduling and resource sharing in the embedded ASSR system. Compared with the conventional works, simulation results indicate that the proposed hardware accelerator achieves 28% less cost, 90% less responding time, an ASSR accuracy of 90%. Comparison exhibits that performance of the proposed system is greater than the conventional works, thereby demonstrating the friendly and affordable factor of the proposed HMI.