Sally Eltenahy, Nihal Fayez, M. Obayya, F. Khalifa
{"title":"基于webbrtc和Web语音api的视频会议语音文本转换","authors":"Sally Eltenahy, Nihal Fayez, M. Obayya, F. Khalifa","doi":"10.1109/ITC-Egypt52936.2021.9513968","DOIUrl":null,"url":null,"abstract":"The spread of coronavirus disease necessitates a significant demand for videoconference meetings increases, such as work meetings, eLearning, healthcare. Moreover, some of the participants in the videoconference may prefer to read the speech of the other participants as a text. This paper presents a solution that converts the speech of videoconference participants in to text in real time. The proposed solution depends on modern browsers JavaScript application programming interfaces (APIs): Web Real-Time Communication (WebRTC) APIs and Web Speech API that enable developers to develop applications for video conferences, chatting, sharing desktop, or sharing data in real-time between browsers, and also convert speech to text in real time. The proposed solution has used OpenVidu framework, that is an open-source videoconference application based on WebRTC technology. Then, adding the feature of speech to text by developing the frontend of OpenVidu framework by integration with Web Speech API.","PeriodicalId":321025,"journal":{"name":"2021 International Telecommunications Conference (ITC-Egypt)","volume":"2011 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Conversion of Videoconference Speech into Text based on WebRTC and Web Speech APIs\",\"authors\":\"Sally Eltenahy, Nihal Fayez, M. Obayya, F. Khalifa\",\"doi\":\"10.1109/ITC-Egypt52936.2021.9513968\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The spread of coronavirus disease necessitates a significant demand for videoconference meetings increases, such as work meetings, eLearning, healthcare. Moreover, some of the participants in the videoconference may prefer to read the speech of the other participants as a text. This paper presents a solution that converts the speech of videoconference participants in to text in real time. The proposed solution depends on modern browsers JavaScript application programming interfaces (APIs): Web Real-Time Communication (WebRTC) APIs and Web Speech API that enable developers to develop applications for video conferences, chatting, sharing desktop, or sharing data in real-time between browsers, and also convert speech to text in real time. The proposed solution has used OpenVidu framework, that is an open-source videoconference application based on WebRTC technology. Then, adding the feature of speech to text by developing the frontend of OpenVidu framework by integration with Web Speech API.\",\"PeriodicalId\":321025,\"journal\":{\"name\":\"2021 International Telecommunications Conference (ITC-Egypt)\",\"volume\":\"2011 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Telecommunications Conference (ITC-Egypt)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ITC-Egypt52936.2021.9513968\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Telecommunications Conference (ITC-Egypt)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITC-Egypt52936.2021.9513968","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Conversion of Videoconference Speech into Text based on WebRTC and Web Speech APIs
The spread of coronavirus disease necessitates a significant demand for videoconference meetings increases, such as work meetings, eLearning, healthcare. Moreover, some of the participants in the videoconference may prefer to read the speech of the other participants as a text. This paper presents a solution that converts the speech of videoconference participants in to text in real time. The proposed solution depends on modern browsers JavaScript application programming interfaces (APIs): Web Real-Time Communication (WebRTC) APIs and Web Speech API that enable developers to develop applications for video conferences, chatting, sharing desktop, or sharing data in real-time between browsers, and also convert speech to text in real time. The proposed solution has used OpenVidu framework, that is an open-source videoconference application based on WebRTC technology. Then, adding the feature of speech to text by developing the frontend of OpenVidu framework by integration with Web Speech API.