RED:基于智能边缘的扬声器系统与环境传感技术

2022 IEEE International Conference on Data Science and Information System (ICDSIS) Pub Date : 2022-07-29 DOI:10.1109/ICDSIS55133.2022.9915823

Tanvi K Jois, M. V. Bharadwaj, A. Mukhopadhyay

{"title":"RED:基于智能边缘的扬声器系统与环境传感技术","authors":"Tanvi K Jois, M. V. Bharadwaj, A. Mukhopadhyay","doi":"10.1109/ICDSIS55133.2022.9915823","DOIUrl":null,"url":null,"abstract":"Smart speakers have become widely common. A smart speaker allows people to do a wide range of tasks, including presenting information such as the time, date, and several other things. It can also stream songs on its own. Smart speakers that can carry out tasks are known as virtual assistants. In this work, a smart speaker RED is demonstrated. RED is particularly designed for audio sensing activities. Mel’s spectrograms are obtained and subjected to rudimentary neural networks to achieve automatic speech recognition in RED. RED can adjust to ambient noise, communicate with the user, and function as a virtual assistant by cracking jokes, playing/pausing music, and altering volume on command. Users can dynamically assign wake words of their desire in RED. RED was able to reach an overall accuracy of 91%.","PeriodicalId":178360,"journal":{"name":"2022 IEEE International Conference on Data Science and Information System (ICDSIS)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"RED: An Intelligent Edge based Speaker System with Ambient Sensing Technology\",\"authors\":\"Tanvi K Jois, M. V. Bharadwaj, A. Mukhopadhyay\",\"doi\":\"10.1109/ICDSIS55133.2022.9915823\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Smart speakers have become widely common. A smart speaker allows people to do a wide range of tasks, including presenting information such as the time, date, and several other things. It can also stream songs on its own. Smart speakers that can carry out tasks are known as virtual assistants. In this work, a smart speaker RED is demonstrated. RED is particularly designed for audio sensing activities. Mel’s spectrograms are obtained and subjected to rudimentary neural networks to achieve automatic speech recognition in RED. RED can adjust to ambient noise, communicate with the user, and function as a virtual assistant by cracking jokes, playing/pausing music, and altering volume on command. Users can dynamically assign wake words of their desire in RED. RED was able to reach an overall accuracy of 91%.\",\"PeriodicalId\":178360,\"journal\":{\"name\":\"2022 IEEE International Conference on Data Science and Information System (ICDSIS)\",\"volume\":\"25 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Data Science and Information System (ICDSIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDSIS55133.2022.9915823\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Data Science and Information System (ICDSIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSIS55133.2022.9915823","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

智能音箱已经变得非常普遍。智能音箱可以让人们完成各种各样的任务，包括展示时间、日期等信息。它还可以自己播放歌曲。可以执行任务的智能扬声器被称为虚拟助手。在这项工作中，展示了一个智能扬声器RED。RED是专门为音频传感活动设计的。获得Mel的声谱图，并对其进行初步的神经网络处理，实现RED的自动语音识别。RED可以根据环境噪音进行调整，与用户进行交流，还可以作为虚拟助手讲笑话、播放/暂停音乐，并根据命令改变音量。用户可以在RED中动态地分配他们想要的唤醒词。RED能够达到91%的总体准确率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

RED: An Intelligent Edge based Speaker System with Ambient Sensing Technology

Smart speakers have become widely common. A smart speaker allows people to do a wide range of tasks, including presenting information such as the time, date, and several other things. It can also stream songs on its own. Smart speakers that can carry out tasks are known as virtual assistants. In this work, a smart speaker RED is demonstrated. RED is particularly designed for audio sensing activities. Mel’s spectrograms are obtained and subjected to rudimentary neural networks to achieve automatic speech recognition in RED. RED can adjust to ambient noise, communicate with the user, and function as a virtual assistant by cracking jokes, playing/pausing music, and altering volume on command. Users can dynamically assign wake words of their desire in RED. RED was able to reach an overall accuracy of 91%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE International Conference on Data Science and Information System (ICDSIS)

自引率

0.00%

发文量