{"title":"Foreign language audio information management system","authors":"Marc Shichman, M. Gaffney, E. Fake, L. Sokol","doi":"10.1109/ICIF.2002.1020993","DOIUrl":null,"url":null,"abstract":"Veridian created a prototype of a foreign language audio information management system that integrates speech recognition technology, machine translation and advanced information retrieval and extraction for Mandarin Chinese. The system automatically processes audio recordings to create a data warehouse of derived information using speech recognition and machine translation technology components. The data warehouse can then be further exploited using information retrieval technology components. The prototype system provides the following capabilities: Automatically transforming foreign audio files into electronic text; Transforming foreign text into English text; Matching transcribed and translated text and the topics of interest to the analyst; Displaying transcribed text by speaker. The conclusions imply that while automatic speech processing technology is far from perfect for mass market distribution, it is sufficiently advanced to help with the overload of audio and video data.","PeriodicalId":399150,"journal":{"name":"Proceedings of the Fifth International Conference on Information Fusion. FUSION 2002. (IEEE Cat.No.02EX5997)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Fifth International Conference on Information Fusion. FUSION 2002. (IEEE Cat.No.02EX5997)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIF.2002.1020993","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Veridian created a prototype of a foreign language audio information management system that integrates speech recognition technology, machine translation and advanced information retrieval and extraction for Mandarin Chinese. The system automatically processes audio recordings to create a data warehouse of derived information using speech recognition and machine translation technology components. The data warehouse can then be further exploited using information retrieval technology components. The prototype system provides the following capabilities: Automatically transforming foreign audio files into electronic text; Transforming foreign text into English text; Matching transcribed and translated text and the topics of interest to the analyst; Displaying transcribed text by speaker. The conclusions imply that while automatic speech processing technology is far from perfect for mass market distribution, it is sufficiently advanced to help with the overload of audio and video data.