Georges Quénot, T. Tan, V. Le, S. Ayache, Laurent Besacier, Philippe Mulhem
{"title":"多语言视听材料的内容搜索","authors":"Georges Quénot, T. Tan, V. Le, S. Ayache, Laurent Besacier, Philippe Mulhem","doi":"10.24348/coria.2009.67","DOIUrl":null,"url":null,"abstract":"ABSTRACT. We present in this paper an approach based on the use of the International PhoneticAlphabet (IPA) for content-based indexing and retrieval of multilingual audiovisual documents.The approach works even if the languages of the document are unknown. It has been validatedin the context of the “Star Challenge” search engine competition organized by the A*STARAgency of Singapore. Our approach includes the building of an IPA-based multilingual acousticmodel and a dynamic programming based method for searching document segments by “IPAstring spotting”. Dynamic programming allows for retrieving the query string in the documentstring even with a significant transcription error rate at the phone level. The methods that wedeveloped ranked us as first and third on the monolingual (English) search task, as fifth on themultilingual search task and as first on the multimodal (audio and image) search task. MOTS-CLES : Recherche audio, Multilingue, Alphabet Phonetique International, ProgrammationDynamique, Star Challenge","PeriodicalId":390974,"journal":{"name":"Conférence en Recherche d'Infomations et Applications","volume":"4 9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Recherche par le contenu dans des documents audiovisuels multilingues\",\"authors\":\"Georges Quénot, T. Tan, V. Le, S. Ayache, Laurent Besacier, Philippe Mulhem\",\"doi\":\"10.24348/coria.2009.67\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"ABSTRACT. We present in this paper an approach based on the use of the International PhoneticAlphabet (IPA) for content-based indexing and retrieval of multilingual audiovisual documents.The approach works even if the languages of the document are unknown. It has been validatedin the context of the “Star Challenge” search engine competition organized by the A*STARAgency of Singapore. Our approach includes the building of an IPA-based multilingual acousticmodel and a dynamic programming based method for searching document segments by “IPAstring spotting”. Dynamic programming allows for retrieving the query string in the documentstring even with a significant transcription error rate at the phone level. The methods that wedeveloped ranked us as first and third on the monolingual (English) search task, as fifth on themultilingual search task and as first on the multimodal (audio and image) search task. MOTS-CLES : Recherche audio, Multilingue, Alphabet Phonetique International, ProgrammationDynamique, Star Challenge\",\"PeriodicalId\":390974,\"journal\":{\"name\":\"Conférence en Recherche d'Infomations et Applications\",\"volume\":\"4 9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Conférence en Recherche d'Infomations et Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.24348/coria.2009.67\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Conférence en Recherche d'Infomations et Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.24348/coria.2009.67","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
摘要我们在本文中提出了一种基于使用国际音标(IPA)的方法,用于基于内容的索引和多语言视听文档的检索。即使文档的语言是未知的,这种方法也有效。在新加坡A*STARAgency组织的“星之挑战”搜索引擎大赛中得到了验证。我们的方法包括建立一个基于ipaa的多语言声学模型和一个基于动态规划的“IPAstring定位”搜索文档片段的方法。动态编程允许在documentstring中检索查询字符串,即使在电话级别上存在显著的转录错误率。我们开发的方法在单语言(英语)搜索任务中排名第一和第三,在多语言搜索任务中排名第五,在多模式(音频和图像)搜索任务中排名第一。MOTS-CLES: Recherche audio, multilingual, Alphabet Phonetique International, programationdynamique, Star Challenge
Recherche par le contenu dans des documents audiovisuels multilingues
ABSTRACT. We present in this paper an approach based on the use of the International PhoneticAlphabet (IPA) for content-based indexing and retrieval of multilingual audiovisual documents.The approach works even if the languages of the document are unknown. It has been validatedin the context of the “Star Challenge” search engine competition organized by the A*STARAgency of Singapore. Our approach includes the building of an IPA-based multilingual acousticmodel and a dynamic programming based method for searching document segments by “IPAstring spotting”. Dynamic programming allows for retrieving the query string in the documentstring even with a significant transcription error rate at the phone level. The methods that wedeveloped ranked us as first and third on the monolingual (English) search task, as fifth on themultilingual search task and as first on the multimodal (audio and image) search task. MOTS-CLES : Recherche audio, Multilingue, Alphabet Phonetique International, ProgrammationDynamique, Star Challenge