Vijaya Nirmala Mitnala, M. Reed, Ian Kegel, J. Bicknell
{"title":"无缝设备切换无处不在的语音通信","authors":"Vijaya Nirmala Mitnala, M. Reed, Ian Kegel, J. Bicknell","doi":"10.1109/ICCSPA55860.2022.10018978","DOIUrl":null,"url":null,"abstract":"Sustained growth in the smart speaker market has helped establish high quality, far-field speech communications as a viable alternative to the handset. Seamless handover offers a simple but effective way of improving the far-field communication experience by automatically switching to the best available device regardless of where a user is located. While the basic concept of seamless handover has been proven in a lab environment, this paper proposes two significant enhancements: reduction in media disruption during handover by introducing a parallel session on multiple devices through session initiation protocol (SIP) call forking; and, coherence-based signal processing to more accurately determine the most suitable device for the user. The solution proposed uses the magnitude square coherence (MSC) and results verified through simulation and real datasets show it has excellent performance. However, the raw MSC is found to have high variation due to room effects, consequently this work shows that a smoothing predictor is needed to significantly reduce the extraneous transitions that would otherwise be subjectively poor. Unlike a purely location based approach, the proposed solution selects the best smart device without any environment specific calibration making it ideal for straightforward deployment of a pervasive speech application that uses smart speakers.","PeriodicalId":106639,"journal":{"name":"2022 5th International Conference on Communications, Signal Processing, and their Applications (ICCSPA)","volume":"381 5","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Seamless device handover for pervasive speech communication\",\"authors\":\"Vijaya Nirmala Mitnala, M. Reed, Ian Kegel, J. Bicknell\",\"doi\":\"10.1109/ICCSPA55860.2022.10018978\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sustained growth in the smart speaker market has helped establish high quality, far-field speech communications as a viable alternative to the handset. Seamless handover offers a simple but effective way of improving the far-field communication experience by automatically switching to the best available device regardless of where a user is located. While the basic concept of seamless handover has been proven in a lab environment, this paper proposes two significant enhancements: reduction in media disruption during handover by introducing a parallel session on multiple devices through session initiation protocol (SIP) call forking; and, coherence-based signal processing to more accurately determine the most suitable device for the user. The solution proposed uses the magnitude square coherence (MSC) and results verified through simulation and real datasets show it has excellent performance. However, the raw MSC is found to have high variation due to room effects, consequently this work shows that a smoothing predictor is needed to significantly reduce the extraneous transitions that would otherwise be subjectively poor. Unlike a purely location based approach, the proposed solution selects the best smart device without any environment specific calibration making it ideal for straightforward deployment of a pervasive speech application that uses smart speakers.\",\"PeriodicalId\":106639,\"journal\":{\"name\":\"2022 5th International Conference on Communications, Signal Processing, and their Applications (ICCSPA)\",\"volume\":\"381 5\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 5th International Conference on Communications, Signal Processing, and their Applications (ICCSPA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCSPA55860.2022.10018978\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Communications, Signal Processing, and their Applications (ICCSPA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSPA55860.2022.10018978","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Seamless device handover for pervasive speech communication
Sustained growth in the smart speaker market has helped establish high quality, far-field speech communications as a viable alternative to the handset. Seamless handover offers a simple but effective way of improving the far-field communication experience by automatically switching to the best available device regardless of where a user is located. While the basic concept of seamless handover has been proven in a lab environment, this paper proposes two significant enhancements: reduction in media disruption during handover by introducing a parallel session on multiple devices through session initiation protocol (SIP) call forking; and, coherence-based signal processing to more accurately determine the most suitable device for the user. The solution proposed uses the magnitude square coherence (MSC) and results verified through simulation and real datasets show it has excellent performance. However, the raw MSC is found to have high variation due to room effects, consequently this work shows that a smoothing predictor is needed to significantly reduce the extraneous transitions that would otherwise be subjectively poor. Unlike a purely location based approach, the proposed solution selects the best smart device without any environment specific calibration making it ideal for straightforward deployment of a pervasive speech application that uses smart speakers.