Conversion of Videoconference Speech into Text based on WebRTC and Web Speech APIs

Sally Eltenahy, Nihal Fayez, M. Obayya, F. Khalifa
{"title":"Conversion of Videoconference Speech into Text based on WebRTC and Web Speech APIs","authors":"Sally Eltenahy, Nihal Fayez, M. Obayya, F. Khalifa","doi":"10.1109/ITC-Egypt52936.2021.9513968","DOIUrl":null,"url":null,"abstract":"The spread of coronavirus disease necessitates a significant demand for videoconference meetings increases, such as work meetings, eLearning, healthcare. Moreover, some of the participants in the videoconference may prefer to read the speech of the other participants as a text. This paper presents a solution that converts the speech of videoconference participants in to text in real time. The proposed solution depends on modern browsers JavaScript application programming interfaces (APIs): Web Real-Time Communication (WebRTC) APIs and Web Speech API that enable developers to develop applications for video conferences, chatting, sharing desktop, or sharing data in real-time between browsers, and also convert speech to text in real time. The proposed solution has used OpenVidu framework, that is an open-source videoconference application based on WebRTC technology. Then, adding the feature of speech to text by developing the frontend of OpenVidu framework by integration with Web Speech API.","PeriodicalId":321025,"journal":{"name":"2021 International Telecommunications Conference (ITC-Egypt)","volume":"2011 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Telecommunications Conference (ITC-Egypt)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITC-Egypt52936.2021.9513968","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The spread of coronavirus disease necessitates a significant demand for videoconference meetings increases, such as work meetings, eLearning, healthcare. Moreover, some of the participants in the videoconference may prefer to read the speech of the other participants as a text. This paper presents a solution that converts the speech of videoconference participants in to text in real time. The proposed solution depends on modern browsers JavaScript application programming interfaces (APIs): Web Real-Time Communication (WebRTC) APIs and Web Speech API that enable developers to develop applications for video conferences, chatting, sharing desktop, or sharing data in real-time between browsers, and also convert speech to text in real time. The proposed solution has used OpenVidu framework, that is an open-source videoconference application based on WebRTC technology. Then, adding the feature of speech to text by developing the frontend of OpenVidu framework by integration with Web Speech API.
基于webbrtc和Web语音api的视频会议语音文本转换
随着冠状病毒的传播,对工作会议、电子学习、医疗保健等视频会议的需求大幅增加。此外,视频会议的一些参与者可能更喜欢将其他参与者的演讲作为文本阅读。本文提出了一种将视频会议参与者的语音实时转换为文本的解决方案。提出的解决方案依赖于现代浏览器的JavaScript应用程序编程接口(API): Web实时通信(WebRTC) API和Web语音API,使开发人员能够开发用于视频会议、聊天、共享桌面或在浏览器之间实时共享数据的应用程序,并且还可以将语音实时转换为文本。本方案采用OpenVidu框架,这是一个基于WebRTC技术的开源视频会议应用。然后,通过集成Web speech API,开发OpenVidu框架前端,为文本添加语音功能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信