R. Rohit, SA Gandheesh, KS Suriya, Peeta Basa Pati
{"title":"基于深度网络语言模型的噪声文本识别精度提高系统","authors":"R. Rohit, SA Gandheesh, KS Suriya, Peeta Basa Pati","doi":"10.1109/I2CT57861.2023.10126194","DOIUrl":null,"url":null,"abstract":"Text from image documents must be recognized for its usage. Various tasks such as plagiarism & error check, language analysis, information capture rely on the accuracy of this text conversion. OCR systems convert the document images to their text equivalent. These OCR systems are prone to introducing errors during the recognition process.This work reports a system developed to ingest image documents which is converted to text using available OCR technologies. The recognized text, subsequently, is processed with deep network language models to enhance the accuracy of text. The system consists of a client server architecture with user interface available from web application as well as from mobile app. For the language models, encoder-decoder based BART & MarianMT are used. The results obtained demonstrate a 35% reduction in WER using the BART language model.","PeriodicalId":150346,"journal":{"name":"2023 IEEE 8th International Conference for Convergence in Technology (I2CT)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"System for Enhancing Accuracy of Noisy Text using Deep Network Language Models\",\"authors\":\"R. Rohit, SA Gandheesh, KS Suriya, Peeta Basa Pati\",\"doi\":\"10.1109/I2CT57861.2023.10126194\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text from image documents must be recognized for its usage. Various tasks such as plagiarism & error check, language analysis, information capture rely on the accuracy of this text conversion. OCR systems convert the document images to their text equivalent. These OCR systems are prone to introducing errors during the recognition process.This work reports a system developed to ingest image documents which is converted to text using available OCR technologies. The recognized text, subsequently, is processed with deep network language models to enhance the accuracy of text. The system consists of a client server architecture with user interface available from web application as well as from mobile app. For the language models, encoder-decoder based BART & MarianMT are used. The results obtained demonstrate a 35% reduction in WER using the BART language model.\",\"PeriodicalId\":150346,\"journal\":{\"name\":\"2023 IEEE 8th International Conference for Convergence in Technology (I2CT)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE 8th International Conference for Convergence in Technology (I2CT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/I2CT57861.2023.10126194\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 8th International Conference for Convergence in Technology (I2CT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/I2CT57861.2023.10126194","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
System for Enhancing Accuracy of Noisy Text using Deep Network Language Models
Text from image documents must be recognized for its usage. Various tasks such as plagiarism & error check, language analysis, information capture rely on the accuracy of this text conversion. OCR systems convert the document images to their text equivalent. These OCR systems are prone to introducing errors during the recognition process.This work reports a system developed to ingest image documents which is converted to text using available OCR technologies. The recognized text, subsequently, is processed with deep network language models to enhance the accuracy of text. The system consists of a client server architecture with user interface available from web application as well as from mobile app. For the language models, encoder-decoder based BART & MarianMT are used. The results obtained demonstrate a 35% reduction in WER using the BART language model.