J. Pérez-Cortes, R. Llobet, J. Navarro-Cerdán, J. Arlandis
{"title":"利用场相互依赖提高基于传感器的OCR后处理系统的校正性能","authors":"J. Pérez-Cortes, R. Llobet, J. Navarro-Cerdán, J. Arlandis","doi":"10.1109/ICFHR.2010.99","DOIUrl":null,"url":null,"abstract":"In an automatic handwritten form processing system it is often necessary to use the lexical or linguistic restrictions present in the field contents in order to obtain acceptable recognition rates. Since each field is known to hold a given kind of information (name, address...), a language model can be defined for it. But, often, in a typical form there are fields linked by known relations, like “Street” and “Postal Code” or “Country” and “City”. We have used Weighted Finite-State Transducers (WFSTs) to combine Stochastic Error-Correcting Language Models from different interdependent fields in real handwritten forms and measured the improvements obtained.","PeriodicalId":335044,"journal":{"name":"2010 12th International Conference on Frontiers in Handwriting Recognition","volume":"320 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Using Field Interdependence to Improve Correction Performance in a Transducer-Based OCR Post-Processing System\",\"authors\":\"J. Pérez-Cortes, R. Llobet, J. Navarro-Cerdán, J. Arlandis\",\"doi\":\"10.1109/ICFHR.2010.99\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In an automatic handwritten form processing system it is often necessary to use the lexical or linguistic restrictions present in the field contents in order to obtain acceptable recognition rates. Since each field is known to hold a given kind of information (name, address...), a language model can be defined for it. But, often, in a typical form there are fields linked by known relations, like “Street” and “Postal Code” or “Country” and “City”. We have used Weighted Finite-State Transducers (WFSTs) to combine Stochastic Error-Correcting Language Models from different interdependent fields in real handwritten forms and measured the improvements obtained.\",\"PeriodicalId\":335044,\"journal\":{\"name\":\"2010 12th International Conference on Frontiers in Handwriting Recognition\",\"volume\":\"320 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 12th International Conference on Frontiers in Handwriting Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICFHR.2010.99\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 12th International Conference on Frontiers in Handwriting Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICFHR.2010.99","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Using Field Interdependence to Improve Correction Performance in a Transducer-Based OCR Post-Processing System
In an automatic handwritten form processing system it is often necessary to use the lexical or linguistic restrictions present in the field contents in order to obtain acceptable recognition rates. Since each field is known to hold a given kind of information (name, address...), a language model can be defined for it. But, often, in a typical form there are fields linked by known relations, like “Street” and “Postal Code” or “Country” and “City”. We have used Weighted Finite-State Transducers (WFSTs) to combine Stochastic Error-Correcting Language Models from different interdependent fields in real handwritten forms and measured the improvements obtained.