Audiovisual Interactive Companionship; The Next Breakthrough in Computers

2020 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS) Pub Date : 2020-09-01 DOI:10.1109/IEMTRONICS51293.2020.9216341

Shahriar Khan

{"title":"Audiovisual Interactive Companionship; The Next Breakthrough in Computers","authors":"Shahriar Khan","doi":"10.1109/IEMTRONICS51293.2020.9216341","DOIUrl":null,"url":null,"abstract":"New developments in voice and facial video generation suggest we are at the verge of a new breakthrough in audio and visual interactive companionship. Today's computer-interactive voices (Alexa and Google Assistant) are largely repetitive, robot-like, and all-knowing, sometimes making them tiring and monotonous. The challenge is to better simulate the human voice and personality, with a face generating the voice. Covid and sprawling cities are making people increasingly isolated. A game or program on TV can be better enjoyed when accompanied by a computer companion. Other than companionship for entertainment, there can be companionship for therapy, such as at a hospital or at an old-age home. Personalities of famous people such as scientists, statesmen, actors, and sportsmen can be recreated. The computer voice can be a training ground for social etiquette for children and adults. A simulated baby voice can be used for training would-be parents. A simulated patient's voice can be used for training doctors and nurses. With videoconferencing being the new norm in these times of Covid, the simulated voice and video will appear more real than before. The bonding of humans with the computer voice raises ethical questions about whether this could become addictive, and whether details of the interaction can be used by the company and the government. Ominously, can the simulated voice of a child become the face of the government, using the data to profile or even arrest users? Could the government use the voice to inspire users to be hard working, law abiding, and tax-paying citizens? It is expected that market forces will prevail, and companionship with the computer voice and video will prevail. We must find ways to regulate computer voice, video and companionship for the greater good.","PeriodicalId":269697,"journal":{"name":"2020 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEMTRONICS51293.2020.9216341","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

New developments in voice and facial video generation suggest we are at the verge of a new breakthrough in audio and visual interactive companionship. Today's computer-interactive voices (Alexa and Google Assistant) are largely repetitive, robot-like, and all-knowing, sometimes making them tiring and monotonous. The challenge is to better simulate the human voice and personality, with a face generating the voice. Covid and sprawling cities are making people increasingly isolated. A game or program on TV can be better enjoyed when accompanied by a computer companion. Other than companionship for entertainment, there can be companionship for therapy, such as at a hospital or at an old-age home. Personalities of famous people such as scientists, statesmen, actors, and sportsmen can be recreated. The computer voice can be a training ground for social etiquette for children and adults. A simulated baby voice can be used for training would-be parents. A simulated patient's voice can be used for training doctors and nurses. With videoconferencing being the new norm in these times of Covid, the simulated voice and video will appear more real than before. The bonding of humans with the computer voice raises ethical questions about whether this could become addictive, and whether details of the interaction can be used by the company and the government. Ominously, can the simulated voice of a child become the face of the government, using the data to profile or even arrest users? Could the government use the voice to inspire users to be hard working, law abiding, and tax-paying citizens? It is expected that market forces will prevail, and companionship with the computer voice and video will prevail. We must find ways to regulate computer voice, video and companionship for the greater good.

查看原文本刊更多论文

视听互动陪伴;计算机的下一个突破

语音和面部视频生成的新发展表明，我们正处于音频和视觉互动陪伴的新突破的边缘。今天的计算机交互语音(Alexa和Google Assistant)在很大程度上是重复的、机器人式的、无所不知的，有时会让人感到厌倦和单调。挑战在于如何更好地模拟人类的声音和个性，用一张脸来生成声音。新冠疫情和不断扩张的城市让人们越来越孤立。电视上的游戏或节目有电脑陪伴才能更好地享受。除了娱乐的陪伴，还有治疗的陪伴，比如在医院或养老院。科学家、政治家、演员和运动员等名人的个性可以被重现。电脑的声音可以成为儿童和成人社交礼仪的训练场。模拟婴儿的声音可以用来训练准父母。模拟病人的声音可以用来培训医生和护士。随着视频会议成为新冠疫情时期的新常态，模拟的语音和视频将比以前更加真实。人类与电脑声音的联系引发了一些伦理问题:这种联系是否会让人上瘾，以及这种互动的细节是否可以被公司和政府利用。不幸的是，一个孩子的模拟声音会成为政府的面孔，利用这些数据来分析甚至逮捕用户吗?政府能否利用这种声音激励用户成为勤奋、守法、纳税的公民?预计市场力量将占上风，与电脑的声音和视频相伴将占上风。为了更大的利益，我们必须找到方法来管理电脑的声音、视频和陪伴。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS)

自引率

0.00%

发文量