结合大语言模型和视觉基础模型的物联网零样本人脸检索

IF 0.9 Q4 TELECOMMUNICATIONS
Jin Lu, Meifen Chen
{"title":"结合大语言模型和视觉基础模型的物联网零样本人脸检索","authors":"Jin Lu,&nbsp;Meifen Chen","doi":"10.1002/itl2.506","DOIUrl":null,"url":null,"abstract":"<p>This paper presents a novel approach to face retrieval that leverages the capabilities of large language models and visual base models, marking a significant departure from traditional IoT text retrieval methods that depend on extensive data collection and model training. By eliminating the need for text-image pair data collection and model training, our method not only dramatically reduces the data and computational costs associated with IoT applications but also achieves high accuracy in face retrieval, as demonstrated by a 72% top-1 accuracy and 93% top-3 accuracy on the Celeb-A dataset. This substantial improvement in efficiency and performance has profound implications for the future of IoT systems, potentially revolutionizing face recognition technology by enabling more scalable, cost-effective, and accurate solutions. The successful application of zero-sample face retrieval illustrates the transformative impact that advanced AI models can have on real-world applications and opens new avenues for research and development in the realm of intelligent IoT devices.</p>","PeriodicalId":100725,"journal":{"name":"Internet Technology Letters","volume":"8 1","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2024-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Zero-sample face retrieval combining large language model and visual base model for IoT\",\"authors\":\"Jin Lu,&nbsp;Meifen Chen\",\"doi\":\"10.1002/itl2.506\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This paper presents a novel approach to face retrieval that leverages the capabilities of large language models and visual base models, marking a significant departure from traditional IoT text retrieval methods that depend on extensive data collection and model training. By eliminating the need for text-image pair data collection and model training, our method not only dramatically reduces the data and computational costs associated with IoT applications but also achieves high accuracy in face retrieval, as demonstrated by a 72% top-1 accuracy and 93% top-3 accuracy on the Celeb-A dataset. This substantial improvement in efficiency and performance has profound implications for the future of IoT systems, potentially revolutionizing face recognition technology by enabling more scalable, cost-effective, and accurate solutions. The successful application of zero-sample face retrieval illustrates the transformative impact that advanced AI models can have on real-world applications and opens new avenues for research and development in the realm of intelligent IoT devices.</p>\",\"PeriodicalId\":100725,\"journal\":{\"name\":\"Internet Technology Letters\",\"volume\":\"8 1\",\"pages\":\"\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2024-01-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Internet Technology Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/itl2.506\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"TELECOMMUNICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet Technology Letters","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/itl2.506","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了一种利用大型语言模型和视觉基础模型能力进行人脸检索的新方法,与依赖大量数据收集和模型训练的传统物联网文本检索方法大相径庭。我们的方法无需进行文本图像对数据收集和模型训练,不仅大大降低了与物联网应用相关的数据和计算成本,还实现了较高的人脸检索准确率,Celeb-A 数据集的前 1 位准确率为 72%,前 3 位准确率为 93%。效率和性能的大幅提升对物联网系统的未来有着深远的影响,通过实现更具可扩展性、成本效益和准确性的解决方案,有可能彻底改变人脸识别技术。零样本人脸检索的成功应用说明了先进的人工智能模型可以对现实世界的应用产生变革性影响,并为智能物联网设备领域的研究和开发开辟了新的途径。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Zero-sample face retrieval combining large language model and visual base model for IoT

This paper presents a novel approach to face retrieval that leverages the capabilities of large language models and visual base models, marking a significant departure from traditional IoT text retrieval methods that depend on extensive data collection and model training. By eliminating the need for text-image pair data collection and model training, our method not only dramatically reduces the data and computational costs associated with IoT applications but also achieves high accuracy in face retrieval, as demonstrated by a 72% top-1 accuracy and 93% top-3 accuracy on the Celeb-A dataset. This substantial improvement in efficiency and performance has profound implications for the future of IoT systems, potentially revolutionizing face recognition technology by enabling more scalable, cost-effective, and accurate solutions. The successful application of zero-sample face retrieval illustrates the transformative impact that advanced AI models can have on real-world applications and opens new avenues for research and development in the realm of intelligent IoT devices.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
3.10
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信