Podcast Hosting Using Spectral Gating And Speech Recognition Methodology

2021 International Conference on Recent Trends on Electronics, Information, Communication & Technology (RTEICT) Pub Date : 2021-08-27 DOI:10.1109/RTEICT52294.2021.9573977

Shubham Lotliker, Gouri Bhatikar, Avina Almeida, Ugam Gaude, S. Naik, V. Jog

{"title":"Podcast Hosting Using Spectral Gating And Speech Recognition Methodology","authors":"Shubham Lotliker, Gouri Bhatikar, Avina Almeida, Ugam Gaude, S. Naik, V. Jog","doi":"10.1109/RTEICT52294.2021.9573977","DOIUrl":null,"url":null,"abstract":"Podcasts contain information in an audio form which is being recorded in human voice. These recorded podcasts are later published on various podcast hosting websites where people can access and listen to various such podcasts. Podcasts can also be used to host an interview at a common location. But due to pandemic, the interviews are conducted through online platforms. The podcasts recorded via such platforms result in poor quality of the audio along with communication problems. To solve this problem, the paper focuses on building an application where interview-based podcasts can be conducted at different geographical locations while preserving the audio quality. Noise in the audio file is removed using Spectral Gating. Subtitles of audio file will also be generated using Speech Recognition algorithm. The final audio file will also generate an RSS feed link which can be used while publishing the podcast to notify subscribed users about the new updates. We have carried out our experiment on more than 100 audio files.","PeriodicalId":191410,"journal":{"name":"2021 International Conference on Recent Trends on Electronics, Information, Communication & Technology (RTEICT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Recent Trends on Electronics, Information, Communication & Technology (RTEICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RTEICT52294.2021.9573977","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Podcasts contain information in an audio form which is being recorded in human voice. These recorded podcasts are later published on various podcast hosting websites where people can access and listen to various such podcasts. Podcasts can also be used to host an interview at a common location. But due to pandemic, the interviews are conducted through online platforms. The podcasts recorded via such platforms result in poor quality of the audio along with communication problems. To solve this problem, the paper focuses on building an application where interview-based podcasts can be conducted at different geographical locations while preserving the audio quality. Noise in the audio file is removed using Spectral Gating. Subtitles of audio file will also be generated using Speech Recognition algorithm. The final audio file will also generate an RSS feed link which can be used while publishing the podcast to notify subscribed users about the new updates. We have carried out our experiment on more than 100 audio files.

查看原文本刊更多论文

播客主机使用频谱门控和语音识别方法

播客以音频的形式包含信息，这些信息是用人声录制的。这些录制的播客随后发布在各种播客托管网站上，人们可以访问和收听各种播客。播客也可以用来在公共地点主持采访。但由于疫情，采访是通过网络平台进行的。通过这些平台录制的播客导致音频质量差以及通信问题。为了解决这个问题，本文的重点是构建一个应用程序，使基于访谈的播客可以在不同的地理位置进行，同时保持音频质量。音频文件中的噪声使用频谱门控去除。音频文件的字幕也将使用语音识别算法生成。最后的音频文件还将生成一个RSS提要链接，该链接可以在发布播客时使用，以通知订阅用户有关新的更新。我们对100多个音频文件进行了实验。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 International Conference on Recent Trends on Electronics, Information, Communication & Technology (RTEICT)

自引率

0.00%

发文量