Language Integration in Remote Sensing: Tasks, datasets, and future directions

IF 16.4 1区 地球科学 Q1 GEOCHEMISTRY & GEOPHYSICS
Laila Bashmal, Yakoub Bazi, Farid Melgani, Mohamad M. Al Rahhal, Mansour Abdulaziz Al Zuair
{"title":"Language Integration in Remote Sensing: Tasks, datasets, and future directions","authors":"Laila Bashmal, Yakoub Bazi, Farid Melgani, Mohamad M. Al Rahhal, Mansour Abdulaziz Al Zuair","doi":"10.1109/mgrs.2023.3316438","DOIUrl":null,"url":null,"abstract":"The emerging field of vision–language models, which combines computer vision and natural language processing (NLP), has gained significant interest and exploration. This integration has opened up new research opportunities, particularly in remote sensing (RS), where it has the potential to enhance RS systems’ capabilities. In this context, this article presents a comprehensive review of more than 100 articles focusing on the integration of NLP techniques into RS understanding research. The review covers various vision–language modeling tasks, including but not limited to RS image captioning, RS text-to-image retrieval, RS visual question answering (VQA), and RS image generation. For each task, the review provides a summary of the state-of-the-art developments, including methods, evaluation metrics, datasets, and experimental results on benchmark datasets. The review is concluded by discussing the key challenges and highlighting potential research directions for future development, with the aim of inspiring further research in this important field.","PeriodicalId":48660,"journal":{"name":"IEEE Geoscience and Remote Sensing Magazine","volume":"20 1","pages":"0"},"PeriodicalIF":16.4000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Geoscience and Remote Sensing Magazine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/mgrs.2023.3316438","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

The emerging field of vision–language models, which combines computer vision and natural language processing (NLP), has gained significant interest and exploration. This integration has opened up new research opportunities, particularly in remote sensing (RS), where it has the potential to enhance RS systems’ capabilities. In this context, this article presents a comprehensive review of more than 100 articles focusing on the integration of NLP techniques into RS understanding research. The review covers various vision–language modeling tasks, including but not limited to RS image captioning, RS text-to-image retrieval, RS visual question answering (VQA), and RS image generation. For each task, the review provides a summary of the state-of-the-art developments, including methods, evaluation metrics, datasets, and experimental results on benchmark datasets. The review is concluded by discussing the key challenges and highlighting potential research directions for future development, with the aim of inspiring further research in this important field.
遥感中的语言集成:任务、数据集和未来方向
视觉语言模型是计算机视觉与自然语言处理(NLP)相结合的新兴领域,已引起人们的极大兴趣和探索。这种整合开辟了新的研究机会,特别是在遥感(RS)方面,它有可能增强RS系统的能力。在此背景下,本文全面回顾了100多篇关于将NLP技术整合到RS理解研究中的文章。该综述涵盖了各种视觉语言建模任务,包括但不限于RS图像字幕、RS文本到图像检索、RS视觉问题回答(VQA)和RS图像生成。对于每个任务,综述提供了最新发展的总结,包括方法、评估指标、数据集和基准数据集的实验结果。本文最后讨论了该领域面临的主要挑战,并指出了未来发展的潜在研究方向,以期对这一重要领域的进一步研究起到启发作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
IEEE Geoscience and Remote Sensing Magazine
IEEE Geoscience and Remote Sensing Magazine Computer Science-General Computer Science
CiteScore
20.50
自引率
2.70%
发文量
58
期刊介绍: The IEEE Geoscience and Remote Sensing Magazine (GRSM) serves as an informative platform, keeping readers abreast of activities within the IEEE GRS Society, its technical committees, and chapters. In addition to updating readers on society-related news, GRSM plays a crucial role in educating and informing its audience through various channels. These include:Technical Papers,International Remote Sensing Activities,Contributions on Education Activities,Industrial and University Profiles,Conference News,Book Reviews,Calendar of Important Events.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信