Hemalatha Alapati, C. Paolini, S. Chinara, M. Sarkar
{"title":"Small-Footprint Keyword Spotting for Controlling Smart Home Appliances Using TCN and CRNN Models","authors":"Hemalatha Alapati, C. Paolini, S. Chinara, M. Sarkar","doi":"10.4018/ijitn.299365","DOIUrl":null,"url":null,"abstract":"Smart homes feature automatic fire/smoke detection, voice-operated assets and appliances etc. More often, smart home appliances like lights, fans, etc., can be controlled through voice commands. Voice-operated devices like Alexa, Siri, and Google Assistant, are not new in the current age concerning voice command execution. However, working with these supports requires a global connection with the internet that costs time and bandwidth. Controlling home appliances need concise commands involving keywords on/off. Further, to operate the home appliances, bandwidth consumption for internet is not a wise idea. Through this paper, models based on Temporal Convolutional Networks (TCN) and Convolutional Recurrent Neural Networks (CRNN) have been studied for Keyword Spotting (KWS) by training models with keywords pronounced in different accents. The performance of these models is compared, and their ability to detect unknown words is studied. Finally, how these models are suitable for building Smart Home assistants to control home utilities with minimum bandwidth consumption is discussed.","PeriodicalId":120331,"journal":{"name":"Int. J. Interdiscip. Telecommun. Netw.","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Interdiscip. Telecommun. Netw.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/ijitn.299365","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Smart homes feature automatic fire/smoke detection, voice-operated assets and appliances etc. More often, smart home appliances like lights, fans, etc., can be controlled through voice commands. Voice-operated devices like Alexa, Siri, and Google Assistant, are not new in the current age concerning voice command execution. However, working with these supports requires a global connection with the internet that costs time and bandwidth. Controlling home appliances need concise commands involving keywords on/off. Further, to operate the home appliances, bandwidth consumption for internet is not a wise idea. Through this paper, models based on Temporal Convolutional Networks (TCN) and Convolutional Recurrent Neural Networks (CRNN) have been studied for Keyword Spotting (KWS) by training models with keywords pronounced in different accents. The performance of these models is compared, and their ability to detect unknown words is studied. Finally, how these models are suitable for building Smart Home assistants to control home utilities with minimum bandwidth consumption is discussed.