{"title":"English to Tamil Multi-Modal Image Captioning Translation","authors":"V. H. Vishnu Kumar, N. Lalithamani","doi":"10.1109/AIC55036.2022.9848810","DOIUrl":null,"url":null,"abstract":"Bridging the gap between the latest technologies and the local languages, Machine Translation has had a big impact on countless non-English speakers who do not get access to these latest technologies coming out which are exclusive only to English Speakers. One such new research field which requires the intervention of Translation is the domain of Image Captioning. With the potential to impact 75 million worldwide users of the language, we have created a one-of-a-kind unique Tamil Image Captioning Dataset, translated from Microsoft’s Common Objects in Context Dataset or commonly called the COCO Dataset, for Captioning of Images in the language of Tamil. With the help of the dataset created, this research work will explore several multi-modal architectures to provide captions of images directly in the language of Tamil for any given input Image. The results of the captions generated have been discussed and evaluated using the popular evaluation metric, Bilingual Evaluation Understudy Score (BLEU).","PeriodicalId":433590,"journal":{"name":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIC55036.2022.9848810","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Bridging the gap between the latest technologies and the local languages, Machine Translation has had a big impact on countless non-English speakers who do not get access to these latest technologies coming out which are exclusive only to English Speakers. One such new research field which requires the intervention of Translation is the domain of Image Captioning. With the potential to impact 75 million worldwide users of the language, we have created a one-of-a-kind unique Tamil Image Captioning Dataset, translated from Microsoft’s Common Objects in Context Dataset or commonly called the COCO Dataset, for Captioning of Images in the language of Tamil. With the help of the dataset created, this research work will explore several multi-modal architectures to provide captions of images directly in the language of Tamil for any given input Image. The results of the captions generated have been discussed and evaluated using the popular evaluation metric, Bilingual Evaluation Understudy Score (BLEU).