Shubham Lotliker, Gouri Bhatikar, Avina Almeida, Ugam Gaude, S. Naik, V. Jog
{"title":"Podcast Hosting Using Spectral Gating And Speech Recognition Methodology","authors":"Shubham Lotliker, Gouri Bhatikar, Avina Almeida, Ugam Gaude, S. Naik, V. Jog","doi":"10.1109/RTEICT52294.2021.9573977","DOIUrl":null,"url":null,"abstract":"Podcasts contain information in an audio form which is being recorded in human voice. These recorded podcasts are later published on various podcast hosting websites where people can access and listen to various such podcasts. Podcasts can also be used to host an interview at a common location. But due to pandemic, the interviews are conducted through online platforms. The podcasts recorded via such platforms result in poor quality of the audio along with communication problems. To solve this problem, the paper focuses on building an application where interview-based podcasts can be conducted at different geographical locations while preserving the audio quality. Noise in the audio file is removed using Spectral Gating. Subtitles of audio file will also be generated using Speech Recognition algorithm. The final audio file will also generate an RSS feed link which can be used while publishing the podcast to notify subscribed users about the new updates. We have carried out our experiment on more than 100 audio files.","PeriodicalId":191410,"journal":{"name":"2021 International Conference on Recent Trends on Electronics, Information, Communication & Technology (RTEICT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Recent Trends on Electronics, Information, Communication & Technology (RTEICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RTEICT52294.2021.9573977","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Podcasts contain information in an audio form which is being recorded in human voice. These recorded podcasts are later published on various podcast hosting websites where people can access and listen to various such podcasts. Podcasts can also be used to host an interview at a common location. But due to pandemic, the interviews are conducted through online platforms. The podcasts recorded via such platforms result in poor quality of the audio along with communication problems. To solve this problem, the paper focuses on building an application where interview-based podcasts can be conducted at different geographical locations while preserving the audio quality. Noise in the audio file is removed using Spectral Gating. Subtitles of audio file will also be generated using Speech Recognition algorithm. The final audio file will also generate an RSS feed link which can be used while publishing the podcast to notify subscribed users about the new updates. We have carried out our experiment on more than 100 audio files.