{"title":"一个端到端插值自动语音识别系统,带有标点符号的印度语文本","authors":"S. Joshi, V. Kumar","doi":"10.1109/ICCCS51487.2021.9776324","DOIUrl":null,"url":null,"abstract":"The Automatic Speech Recognition System (ASR) produces transcripts that often are misinterpreted and confuses the reader due to lack of context and punctuations. The presence of punctuation in the text improves readability and helps in better cognitive understanding. A wide variety of work has been done on English but Hindi which is the third-largest spoken language in the world, after English and Mandarin, still remains in the shadows. This paper aims to extend the technology to a wider section and introduces an end-to-end system, interpolating an automatic speech recognition system and natural language processing to produce high-quality punctuated transcriptions for the Hindi language. An ASR is implemented using the Kaldi toolkit leveraging the hybrid deep neural networks and the punctuation restoration is done with Bidirectional RNNs with an attention network.","PeriodicalId":120389,"journal":{"name":"2021 6th International Conference on Computing, Communication and Security (ICCCS)","volume":"412 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An end-to-end interpolated Automatic speech recognition system with punctuated transcripts for the Hindi language\",\"authors\":\"S. Joshi, V. Kumar\",\"doi\":\"10.1109/ICCCS51487.2021.9776324\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Automatic Speech Recognition System (ASR) produces transcripts that often are misinterpreted and confuses the reader due to lack of context and punctuations. The presence of punctuation in the text improves readability and helps in better cognitive understanding. A wide variety of work has been done on English but Hindi which is the third-largest spoken language in the world, after English and Mandarin, still remains in the shadows. This paper aims to extend the technology to a wider section and introduces an end-to-end system, interpolating an automatic speech recognition system and natural language processing to produce high-quality punctuated transcriptions for the Hindi language. An ASR is implemented using the Kaldi toolkit leveraging the hybrid deep neural networks and the punctuation restoration is done with Bidirectional RNNs with an attention network.\",\"PeriodicalId\":120389,\"journal\":{\"name\":\"2021 6th International Conference on Computing, Communication and Security (ICCCS)\",\"volume\":\"412 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 6th International Conference on Computing, Communication and Security (ICCCS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCS51487.2021.9776324\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 6th International Conference on Computing, Communication and Security (ICCCS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCS51487.2021.9776324","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An end-to-end interpolated Automatic speech recognition system with punctuated transcripts for the Hindi language
The Automatic Speech Recognition System (ASR) produces transcripts that often are misinterpreted and confuses the reader due to lack of context and punctuations. The presence of punctuation in the text improves readability and helps in better cognitive understanding. A wide variety of work has been done on English but Hindi which is the third-largest spoken language in the world, after English and Mandarin, still remains in the shadows. This paper aims to extend the technology to a wider section and introduces an end-to-end system, interpolating an automatic speech recognition system and natural language processing to produce high-quality punctuated transcriptions for the Hindi language. An ASR is implemented using the Kaldi toolkit leveraging the hybrid deep neural networks and the punctuation restoration is done with Bidirectional RNNs with an attention network.