{"title":"DNA-CBIR: DNA Translation Inspired Codon Pattern-based Deep Image Feature Extraction for Content-based Image Retrieval.","authors":"Jitesh Pradhan, Hathiram Nenavath","doi":"10.1109/TNB.2025.3540102","DOIUrl":null,"url":null,"abstract":"<p><p>DNA is emerging as a promising medium for storing huge volumes of data in a confined space that remains intact for thousands of years. Although this technique is very efficient, especially for multimedia data like images, there is a lack of efficient searching and retrieval technique. This paper addresses this issue and proposes a novel Content Based Image Retrieval (CBIR) technique to retrieve similar images from the generated DNA-based image feature vectors. The features are obtained by a novel encoding scheme that uses the three Most-Significant Bits from the images and converts them into a string of nucleotides that follow run length and GC constraints to form DNA planes stored in a DNA medium. The nucleotides in these planes are interpreted through three consecutive sequences forming codons. The codon-based features are then utilized to perform instance-based image retrieval. The DNA planes are further adapted and implemented on diverse deep learning architectures, including ResNet-50, VGG-16, VGG-19, and Inception V3, to facilitate classification-based image retrieval tasks. The system's performance has been assessed using a range of datasets, encompassing coral, medical, and multi-label images. Experimental results demonstrate that the proposed approach achieves notable improvements when compared to existing state-of-the-art methods.</p>","PeriodicalId":13264,"journal":{"name":"IEEE Transactions on NanoBioscience","volume":"PP ","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on NanoBioscience","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1109/TNB.2025.3540102","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
DNA is emerging as a promising medium for storing huge volumes of data in a confined space that remains intact for thousands of years. Although this technique is very efficient, especially for multimedia data like images, there is a lack of efficient searching and retrieval technique. This paper addresses this issue and proposes a novel Content Based Image Retrieval (CBIR) technique to retrieve similar images from the generated DNA-based image feature vectors. The features are obtained by a novel encoding scheme that uses the three Most-Significant Bits from the images and converts them into a string of nucleotides that follow run length and GC constraints to form DNA planes stored in a DNA medium. The nucleotides in these planes are interpreted through three consecutive sequences forming codons. The codon-based features are then utilized to perform instance-based image retrieval. The DNA planes are further adapted and implemented on diverse deep learning architectures, including ResNet-50, VGG-16, VGG-19, and Inception V3, to facilitate classification-based image retrieval tasks. The system's performance has been assessed using a range of datasets, encompassing coral, medical, and multi-label images. Experimental results demonstrate that the proposed approach achieves notable improvements when compared to existing state-of-the-art methods.
期刊介绍:
The IEEE Transactions on NanoBioscience reports on original, innovative and interdisciplinary work on all aspects of molecular systems, cellular systems, and tissues (including molecular electronics). Topics covered in the journal focus on a broad spectrum of aspects, both on foundations and on applications. Specifically, methods and techniques, experimental aspects, design and implementation, instrumentation and laboratory equipment, clinical aspects, hardware and software data acquisition and analysis and computer based modelling are covered (based on traditional or high performance computing - parallel computers or computer networks).