Dark‐Mode Human–Machine Communication Realized by Persistent Luminescence and Deep Learning

Advanced Intelligent Systems Pub Date : 2022-04-29 DOI:10.1002/aisy.202200036

Suman Timilsina, Hoonjae Shin, K. Sohn, Ji Sik Kim

{"title":"Dark‐Mode Human–Machine Communication Realized by Persistent Luminescence and Deep Learning","authors":"Suman Timilsina, Hoonjae Shin, K. Sohn, Ji Sik Kim","doi":"10.1002/aisy.202200036","DOIUrl":null,"url":null,"abstract":"Increasing ubiquitous collaborative intelligence between humans and machines requires human–machine communication (HMC) that is more human and less machine‐like to accomplish given tasks. Although speech signals are considered the best modes of communication in HMC, background noise often interferes with these signals. Therefore, research focused on integrating lip‐reading technology into HMC has gained significant attention. However, lip‐reading functions effectively only in well‐lit environments. In contrast, HMC may occur daily in dark environments owing to potential energy shortages, increased exploration in darkness, nighttime emergencies, etc. Herein, a possible method for HMC in the dark mode is presented, which is realized based on deep learning motion patterns of persistent luminescence (PL) of the skin surrounding the lips. An ultrasoft PL–polymer composite patch is used to record the motion pattern of the skin during speech in the dark. It is found that visual geometric group network (VGGNET‐5) and residual neural network (ResNet‐34) could predict spoken words in darkness with test accuracies of 98.5% and 98.75%, respectively. Furthermore, these models could effectively distinguish similar‐sounding words such as “around” and “ground.” Dark‐mode communication can allow a wide range of people, including disabled people with limited dexterity and voice tremors, to communicate with artificial intelligence machines.","PeriodicalId":7187,"journal":{"name":"Advanced Intelligent Systems","volume":"207 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/aisy.202200036","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Increasing ubiquitous collaborative intelligence between humans and machines requires human–machine communication (HMC) that is more human and less machine‐like to accomplish given tasks. Although speech signals are considered the best modes of communication in HMC, background noise often interferes with these signals. Therefore, research focused on integrating lip‐reading technology into HMC has gained significant attention. However, lip‐reading functions effectively only in well‐lit environments. In contrast, HMC may occur daily in dark environments owing to potential energy shortages, increased exploration in darkness, nighttime emergencies, etc. Herein, a possible method for HMC in the dark mode is presented, which is realized based on deep learning motion patterns of persistent luminescence (PL) of the skin surrounding the lips. An ultrasoft PL–polymer composite patch is used to record the motion pattern of the skin during speech in the dark. It is found that visual geometric group network (VGGNET‐5) and residual neural network (ResNet‐34) could predict spoken words in darkness with test accuracies of 98.5% and 98.75%, respectively. Furthermore, these models could effectively distinguish similar‐sounding words such as “around” and “ground.” Dark‐mode communication can allow a wide range of people, including disabled people with limited dexterity and voice tremors, to communicate with artificial intelligence machines.

查看原文本刊更多论文

基于持续发光和深度学习的暗模式人机通信

越来越多的人与机器之间无处不在的协作智能要求人机通信(HMC)更人性化，更少机器化来完成给定的任务。虽然语音信号被认为是HMC中最好的通信方式，但背景噪声经常干扰这些信号。因此，将唇读技术融入HMC的研究得到了广泛的关注。然而，唇读只有在光线充足的环境下才能有效发挥作用。相比之下，HMC可能每天都在黑暗环境中发生，因为潜在的能源短缺、在黑暗中探险的增加、夜间突发事件等。在此基础上，提出了一种基于深度学习唇周皮肤持续发光(PL)运动模式的暗模式下HMC的实现方法。一个超软的pl -聚合物复合贴片被用来记录在黑暗中说话时皮肤的运动模式。研究发现，视觉几何群网络(VGGNET‐5)和残差神经网络(ResNet‐34)在黑暗环境下预测言语的准确率分别为98.5%和98.75%。此外，这些模型可以有效地区分发音相似的单词，如“around”和“ground”。暗模式通信可以让很多人与人工智能机器进行交流，包括手脚不灵活和声音颤抖的残疾人。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Advanced Intelligent Systems

自引率

0.00%

发文量