{"title":"英语到泰米尔语的多模态图像字幕翻译","authors":"V. H. Vishnu Kumar, N. Lalithamani","doi":"10.1109/AIC55036.2022.9848810","DOIUrl":null,"url":null,"abstract":"Bridging the gap between the latest technologies and the local languages, Machine Translation has had a big impact on countless non-English speakers who do not get access to these latest technologies coming out which are exclusive only to English Speakers. One such new research field which requires the intervention of Translation is the domain of Image Captioning. With the potential to impact 75 million worldwide users of the language, we have created a one-of-a-kind unique Tamil Image Captioning Dataset, translated from Microsoft’s Common Objects in Context Dataset or commonly called the COCO Dataset, for Captioning of Images in the language of Tamil. With the help of the dataset created, this research work will explore several multi-modal architectures to provide captions of images directly in the language of Tamil for any given input Image. The results of the captions generated have been discussed and evaluated using the popular evaluation metric, Bilingual Evaluation Understudy Score (BLEU).","PeriodicalId":433590,"journal":{"name":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"English to Tamil Multi-Modal Image Captioning Translation\",\"authors\":\"V. H. Vishnu Kumar, N. Lalithamani\",\"doi\":\"10.1109/AIC55036.2022.9848810\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Bridging the gap between the latest technologies and the local languages, Machine Translation has had a big impact on countless non-English speakers who do not get access to these latest technologies coming out which are exclusive only to English Speakers. One such new research field which requires the intervention of Translation is the domain of Image Captioning. With the potential to impact 75 million worldwide users of the language, we have created a one-of-a-kind unique Tamil Image Captioning Dataset, translated from Microsoft’s Common Objects in Context Dataset or commonly called the COCO Dataset, for Captioning of Images in the language of Tamil. With the help of the dataset created, this research work will explore several multi-modal architectures to provide captions of images directly in the language of Tamil for any given input Image. The results of the captions generated have been discussed and evaluated using the popular evaluation metric, Bilingual Evaluation Understudy Score (BLEU).\",\"PeriodicalId\":433590,\"journal\":{\"name\":\"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AIC55036.2022.9848810\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE World Conference on Applied Intelligence and Computing (AIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIC55036.2022.9848810","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
English to Tamil Multi-Modal Image Captioning Translation
Bridging the gap between the latest technologies and the local languages, Machine Translation has had a big impact on countless non-English speakers who do not get access to these latest technologies coming out which are exclusive only to English Speakers. One such new research field which requires the intervention of Translation is the domain of Image Captioning. With the potential to impact 75 million worldwide users of the language, we have created a one-of-a-kind unique Tamil Image Captioning Dataset, translated from Microsoft’s Common Objects in Context Dataset or commonly called the COCO Dataset, for Captioning of Images in the language of Tamil. With the help of the dataset created, this research work will explore several multi-modal architectures to provide captions of images directly in the language of Tamil for any given input Image. The results of the captions generated have been discussed and evaluated using the popular evaluation metric, Bilingual Evaluation Understudy Score (BLEU).