Zixin Liu, Ling Lei, Xinyu Huang, Xinyu Li, Hongiian Liu
{"title":"Design and Realization of Dialect Interaction System Based on VAD","authors":"Zixin Liu, Ling Lei, Xinyu Huang, Xinyu Li, Hongiian Liu","doi":"10.1109/ICCST53801.2021.00026","DOIUrl":null,"url":null,"abstract":"In view of the miscommunication problem between Mandarin users and dialect users, we design a portable dialect interaction system based on the raspberry pi. We use IFLYTEK for recognition and synthesis, Turing Robot for intention understanding and response. Recognition results and interaction effect can be more accurate by using voice activity detection (VAD) to preprocess speech signal, which removes noise and separates speech signals by judging the starting and ending points of speech. The strong scalability and good hardware adaptability of raspberry pi make the system have good practicability.","PeriodicalId":222463,"journal":{"name":"2021 International Conference on Culture-oriented Science & Technology (ICCST)","volume":"48 8","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Culture-oriented Science & Technology (ICCST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCST53801.2021.00026","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In view of the miscommunication problem between Mandarin users and dialect users, we design a portable dialect interaction system based on the raspberry pi. We use IFLYTEK for recognition and synthesis, Turing Robot for intention understanding and response. Recognition results and interaction effect can be more accurate by using voice activity detection (VAD) to preprocess speech signal, which removes noise and separates speech signals by judging the starting and ending points of speech. The strong scalability and good hardware adaptability of raspberry pi make the system have good practicability.