A Multi-modal Virtual-Real Fusion System for Multi-task Human-Computer Interaction

2022 IEEE International Conference on Networking, Sensing and Control (ICNSC) Pub Date : 2022-12-15 DOI:10.1109/ICNSC55942.2022.10004097

Xin-Li Zhang, Jiahui Yu, Yuxiang Sun, Min Li, Yang Song, Xianzhong Zhou

{"title":"A Multi-modal Virtual-Real Fusion System for Multi-task Human-Computer Interaction","authors":"Xin-Li Zhang, Jiahui Yu, Yuxiang Sun, Min Li, Yang Song, Xianzhong Zhou","doi":"10.1109/ICNSC55942.2022.10004097","DOIUrl":null,"url":null,"abstract":"Due to the complexity of the task and the diversification of the scene, the traditional interactive control method can no longer meet the requirements of users. To solve this problem, a multi-modal virtual-real fusion system for multi-task human-computer interaction is proposed in this paper, which integrates eye movement, gesture and voice. Frist, aiming at the phenomenon of multi-task and multi-modal, a task-modal matching model is established. Then, the task-modal matching model is abstracted into a multi-objective optimization problem, and a method for solving this problem is designed and a matching scheme is successfully obtained. Meanwhile, the construction of the system is completed for the virtual-real fusion environment, and the control of unmanned car and virtual car is realized. The system can carry out multi-modal interaction and complete multiple tasks in real scene, virtual scene, parallel system and virtual-real fusion scene. Finally, the experiment proves the stability and reliability of the system.","PeriodicalId":230499,"journal":{"name":"2022 IEEE International Conference on Networking, Sensing and Control (ICNSC)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Networking, Sensing and Control (ICNSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNSC55942.2022.10004097","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Due to the complexity of the task and the diversification of the scene, the traditional interactive control method can no longer meet the requirements of users. To solve this problem, a multi-modal virtual-real fusion system for multi-task human-computer interaction is proposed in this paper, which integrates eye movement, gesture and voice. Frist, aiming at the phenomenon of multi-task and multi-modal, a task-modal matching model is established. Then, the task-modal matching model is abstracted into a multi-objective optimization problem, and a method for solving this problem is designed and a matching scheme is successfully obtained. Meanwhile, the construction of the system is completed for the virtual-real fusion environment, and the control of unmanned car and virtual car is realized. The system can carry out multi-modal interaction and complete multiple tasks in real scene, virtual scene, parallel system and virtual-real fusion scene. Finally, the experiment proves the stability and reliability of the system.

查看原文本刊更多论文

面向多任务人机交互的多模态虚实融合系统

由于任务的复杂性和场景的多样化，传统的交互式控制方法已经不能满足用户的要求。为解决这一问题，本文提出了一种融合眼动、手势和语音的多模态虚实融合多任务人机交互系统。首先，针对多任务、多模态现象，建立了任务-模态匹配模型。然后，将任务-模态匹配模型抽象为多目标优化问题，设计了求解该问题的方法，成功地获得了匹配方案。同时，完成了虚实融合环境下的系统构建，实现了无人车与虚拟车的控制。该系统可以在真实场景、虚拟场景、并行系统和虚实融合场景中进行多模态交互，完成多项任务。最后，通过实验验证了系统的稳定性和可靠性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE International Conference on Networking, Sensing and Control (ICNSC)

自引率

0.00%

发文量