A survey of technologies supporting design of a multimodal interactive robot for military communication

Q3 Decision Sciences
Sheuli Paul
{"title":"A survey of technologies supporting design of a multimodal interactive robot for military communication","authors":"Sheuli Paul","doi":"10.1108/jdal-11-2022-0010","DOIUrl":null,"url":null,"abstract":"Purpose This paper presents a survey of research into interactive robotic systems for the purpose of identifying the state of the art capabilities as well as the extant gaps in this emerging field. Communication is multimodal. Multimodality is a representation of many modes chosen from rhetorical aspects for its communication potentials. The author seeks to define the available automation capabilities in communication using multimodalities that will support a proposed Interactive Robot System (IRS) as an AI mounted robotic platform to advance the speed and quality of military operational and tactical decision making. Design/methodology/approach This review will begin by presenting key developments in the robotic interaction field with the objective of identifying essential technological developments that set conditions for robotic platforms to function autonomously. After surveying the key aspects in Human Robot Interaction (HRI), Unmanned Autonomous System (UAS), visualization, Virtual Environment (VE) and prediction, the paper then proceeds to describe the gaps in the application areas that will require extension and integration to enable the prototyping of the IRS. A brief examination of other work in HRI-related fields concludes with a recapitulation of the IRS challenge that will set conditions for future success. Findings Using insights from a balanced cross section of sources from the government, academic, and commercial entities that contribute to HRI a multimodal IRS in military communication is introduced. Multimodal IRS (MIRS) in military communication has yet to be deployed. Research limitations/implications Multimodal robotic interface for the MIRS is an interdisciplinary endeavour. This is not realistic that one can comprehend all expert and related knowledge and skills to design and develop such multimodal interactive robotic interface. In this brief preliminary survey, the author has discussed extant AI, robotics, NLP, CV, VDM, and VE applications that is directly related to multimodal interaction. Each mode of this multimodal communication is an active research area. Multimodal human/military robot communication is the ultimate goal of this research. Practical implications A multimodal autonomous robot in military communication using speech, images, gestures, VST and VE has yet to be deployed. Autonomous multimodal communication is expected to open wider possibilities for all armed forces. Given the density of the land domain, the army is in a position to exploit the opportunities for human–machine teaming (HMT) exposure. Naval and air forces will adopt platform specific suites for specially selected operators to integrate with and leverage this emerging technology. The possession of a flexible communications means that readily adapts to virtual training will enhance planning and mission rehearsals tremendously. Social implications Interaction, perception, cognition and visualization based multimodal communication system is yet missing. Options to communicate, express and convey information in HMT setting with multiple options, suggestions and recommendations will certainly enhance military communication, strength, engagement, security, cognition, perception as well as the ability to act confidently for a successful mission. Originality/value The objective is to develop a multimodal autonomous interactive robot for military communications. This survey reports the state of the art, what exists and what is missing, what can be done and possibilities of extension that support the military in maintaining effective communication using multimodalities. There are some separate ongoing progresses, such as in machine-enabled speech, image recognition, tracking, visualizations for situational awareness, and virtual environments. At this time, there is no integrated approach for multimodal human robot interaction that proposes a flexible and agile communication. The report briefly introduces the research proposal about multimodal interactive robot in military communication.","PeriodicalId":32838,"journal":{"name":"Journal of Defense Analytics and Logistics","volume":"56 42","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Defense Analytics and Logistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/jdal-11-2022-0010","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Decision Sciences","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose This paper presents a survey of research into interactive robotic systems for the purpose of identifying the state of the art capabilities as well as the extant gaps in this emerging field. Communication is multimodal. Multimodality is a representation of many modes chosen from rhetorical aspects for its communication potentials. The author seeks to define the available automation capabilities in communication using multimodalities that will support a proposed Interactive Robot System (IRS) as an AI mounted robotic platform to advance the speed and quality of military operational and tactical decision making. Design/methodology/approach This review will begin by presenting key developments in the robotic interaction field with the objective of identifying essential technological developments that set conditions for robotic platforms to function autonomously. After surveying the key aspects in Human Robot Interaction (HRI), Unmanned Autonomous System (UAS), visualization, Virtual Environment (VE) and prediction, the paper then proceeds to describe the gaps in the application areas that will require extension and integration to enable the prototyping of the IRS. A brief examination of other work in HRI-related fields concludes with a recapitulation of the IRS challenge that will set conditions for future success. Findings Using insights from a balanced cross section of sources from the government, academic, and commercial entities that contribute to HRI a multimodal IRS in military communication is introduced. Multimodal IRS (MIRS) in military communication has yet to be deployed. Research limitations/implications Multimodal robotic interface for the MIRS is an interdisciplinary endeavour. This is not realistic that one can comprehend all expert and related knowledge and skills to design and develop such multimodal interactive robotic interface. In this brief preliminary survey, the author has discussed extant AI, robotics, NLP, CV, VDM, and VE applications that is directly related to multimodal interaction. Each mode of this multimodal communication is an active research area. Multimodal human/military robot communication is the ultimate goal of this research. Practical implications A multimodal autonomous robot in military communication using speech, images, gestures, VST and VE has yet to be deployed. Autonomous multimodal communication is expected to open wider possibilities for all armed forces. Given the density of the land domain, the army is in a position to exploit the opportunities for human–machine teaming (HMT) exposure. Naval and air forces will adopt platform specific suites for specially selected operators to integrate with and leverage this emerging technology. The possession of a flexible communications means that readily adapts to virtual training will enhance planning and mission rehearsals tremendously. Social implications Interaction, perception, cognition and visualization based multimodal communication system is yet missing. Options to communicate, express and convey information in HMT setting with multiple options, suggestions and recommendations will certainly enhance military communication, strength, engagement, security, cognition, perception as well as the ability to act confidently for a successful mission. Originality/value The objective is to develop a multimodal autonomous interactive robot for military communications. This survey reports the state of the art, what exists and what is missing, what can be done and possibilities of extension that support the military in maintaining effective communication using multimodalities. There are some separate ongoing progresses, such as in machine-enabled speech, image recognition, tracking, visualizations for situational awareness, and virtual environments. At this time, there is no integrated approach for multimodal human robot interaction that proposes a flexible and agile communication. The report briefly introduces the research proposal about multimodal interactive robot in military communication.
军用通信多模态交互机器人支撑设计技术综述
本文介绍了交互式机器人系统的研究概况,目的是确定这一新兴领域的最先进能力以及存在的差距。交流是多模式的。多模态是从修辞学角度根据交际潜力选择的多种模态的表现。作者试图定义使用多模态通信的可用自动化能力,这将支持拟议的交互式机器人系统(IRS)作为人工智能安装的机器人平台,以提高军事行动和战术决策的速度和质量。本综述将首先介绍机器人交互领域的关键发展,目的是确定为机器人平台自主运行设定条件的基本技术发展。在调查了人机交互(HRI)、无人自主系统(UAS)、可视化、虚拟环境(VE)和预测的关键方面之后,本文接着描述了应用领域的差距,这些领域需要扩展和集成以实现IRS的原型。简要审查了人力资源调查相关领域的其他工作,总结了人力资源调查面临的挑战,这将为未来的成功创造条件。利用对HRI有贡献的政府、学术和商业实体来源的平衡横截面的见解,介绍了军事通信中的多模态IRS。多模态红外光谱(MIRS)在军事通信中尚未得到应用。MIRS的多模态机器人接口是一项跨学科的努力。这是不现实的,一个人可以理解所有的专家和相关的知识和技能来设计和开发这样的多模态交互机器人界面。在这个简短的初步调查中,作者讨论了与多模态交互直接相关的现有AI,机器人,NLP, CV, VDM和VE应用。这种多模式通信的每一种模式都是一个活跃的研究领域。多模态人/军用机器人通信是本研究的最终目标。在军事通信中使用语音、图像、手势、VST和VE的多模态自主机器人尚未部署。自主多模式通信有望为所有武装部队开辟更广泛的可能性。鉴于陆地领域的密度,军队处于利用人机合作(HMT)暴露的机会的位置。海军和空军将采用平台特定套件,为特殊选择的运营商集成和利用这种新兴技术。拥有灵活的通信手段,随时适应虚拟训练,将极大地加强规划和任务排练。基于交互、感知、认知和可视化的多模态通信系统尚不完善。通过多种选择、建议和建议,在HMT环境中进行沟通、表达和传达信息的选择,必将增强军事沟通、实力、交战、安全、认知、感知以及为成功完成任务而自信行动的能力。目标是开发用于军事通信的多模态自主交互机器人。这项调查报告了最新的技术状况、存在的和缺失的、可以做的和扩展的可能性,以支持军方使用多种方式保持有效的通信。还有一些单独的正在进行的进展,例如在机器支持的语音、图像识别、跟踪、态势感知的可视化和虚拟环境方面。目前,还没有针对多模态人机交互提出灵活敏捷通信的集成方法。简要介绍了军用通信中多模态交互机器人的研究方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
0.90
自引率
0.00%
发文量
5
审稿时长
12 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信