CAAI Transactions on Intelligence Technology最新文献

筛选
英文 中文
Implicit policy constraint for offline reinforcement learning 离线强化学习的隐性策略约束
IF 8.4 2区 计算机科学
CAAI Transactions on Intelligence Technology Pub Date : 2024-03-15 DOI: 10.1049/cit2.12304
Zhiyong Peng, Yadong Liu, Changlin Han, Zongtan Zhou
{"title":"Implicit policy constraint for offline reinforcement learning","authors":"Zhiyong Peng,&nbsp;Yadong Liu,&nbsp;Changlin Han,&nbsp;Zongtan Zhou","doi":"10.1049/cit2.12304","DOIUrl":"10.1049/cit2.12304","url":null,"abstract":"<p>Offline reinforcement learning (RL) aims to learn policies entirely from passively collected datasets, making it a data-driven decision method. One of the main challenges in offline RL is the distribution shift problem, which causes the algorithm to visit out-of-distribution (OOD) samples. The distribution shift can be mitigated by constraining the divergence between the target policy and the behaviour policy. However, this method can overly constrain the target policy and impair the algorithm's performance, as it does not directly distinguish between in-distribution and OOD samples. In addition, it is difficult to learn and represent multi-modal behaviour policy when the datasets are collected by several different behaviour policies. To overcome these drawbacks, the authors address the distribution shift problem by implicit policy constraints with energy-based models (EBMs) rather than explicitly modelling the behaviour policy. The EBM is powerful for representing complex multi-modal distributions as well as the ability to distinguish in-distribution samples and OODs. Experimental results show that their method significantly outperforms the explicit policy constraint method and other baselines. In addition, the learnt energy model can be used to indicate OOD visits and alert the possible failure.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 4","pages":"973-981"},"PeriodicalIF":8.4,"publicationDate":"2024-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12304","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140237574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GAN-MD: A myocarditis detection using multi-channel convolutional neural networks and generative adversarial network-based data augmentation GAN-MD:利用多通道卷积神经网络和基于生成对抗网络的数据增强技术检测心肌炎
IF 8.4 2区 计算机科学
CAAI Transactions on Intelligence Technology Pub Date : 2024-03-14 DOI: 10.1049/cit2.12307
Hengame Ahmadi Golilarz, Alireza Azadbar, Roohallah Alizadehsani, Juan Manuel Gorriz
{"title":"GAN-MD: A myocarditis detection using multi-channel convolutional neural networks and generative adversarial network-based data augmentation","authors":"Hengame Ahmadi Golilarz,&nbsp;Alireza Azadbar,&nbsp;Roohallah Alizadehsani,&nbsp;Juan Manuel Gorriz","doi":"10.1049/cit2.12307","DOIUrl":"10.1049/cit2.12307","url":null,"abstract":"<p>Myocarditis is a significant public health concern because of its potential to cause heart failure and sudden death. The standard invasive diagnostic method, endomyocardial biopsy, is typically reserved for cases with severe complications, limiting its widespread use. Conversely, non-invasive cardiac magnetic resonance (CMR) imaging presents a promising alternative for detecting and monitoring myocarditis, because of its high signal contrast that reveals myocardial involvement. To assist medical professionals via artificial intelligence, the authors introduce generative adversarial networks - multi discriminator (GAN-MD), a deep learning model that uses binary classification to diagnose myocarditis from CMR images. Their approach employs a series of convolutional neural networks (CNNs) that extract and combine feature vectors for accurate diagnosis. The authors suggest a novel technique for improving the classification precision of CNNs. Using generative adversarial networks (GANs) to create synthetic images for data augmentation, the authors address challenges such as mode collapse and unstable training. Incorporating a reconstruction loss into the GAN loss function requires the generator to produce images reflecting the discriminator features, thus enhancing the generated images' quality to more accurately replicate authentic data patterns. Moreover, combining this loss function with other regularisation methods, such as gradient penalty, has proven to further improve the performance of diverse GAN models. A significant challenge in myocarditis diagnosis is the imbalance of classification, where one class dominates over the other. To mitigate this, the authors introduce a focal loss-based training method that effectively trains the model on the minority class samples. The GAN-MD approach, evaluated on the Z-Alizadeh Sani myocarditis dataset, achieves superior results (F-measure 86.2%; geometric mean 91.0%) compared with other deep learning models and traditional machine learning methods.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 4","pages":"866-878"},"PeriodicalIF":8.4,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12307","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140245001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cloud-based video streaming services: Trends, challenges, and opportunities 基于云的视频流服务:趋势、挑战和机遇
IF 5.1 2区 计算机科学
CAAI Transactions on Intelligence Technology Pub Date : 2024-03-14 DOI: 10.1049/cit2.12299
Tajinder Kumar, Purushottam Sharma, Jaswinder Tanwar, Hisham Alsghier, Shashi Bhushan, Hesham Alhumyani, Vivek Sharma, Ahmed I. Alutaibi
{"title":"Cloud-based video streaming services: Trends, challenges, and opportunities","authors":"Tajinder Kumar,&nbsp;Purushottam Sharma,&nbsp;Jaswinder Tanwar,&nbsp;Hisham Alsghier,&nbsp;Shashi Bhushan,&nbsp;Hesham Alhumyani,&nbsp;Vivek Sharma,&nbsp;Ahmed I. Alutaibi","doi":"10.1049/cit2.12299","DOIUrl":"10.1049/cit2.12299","url":null,"abstract":"<p>Cloud computing has drastically changed the delivery and consumption of live streaming content. The designs, challenges, and possible uses of cloud computing for live streaming are studied. A comprehensive overview of the technical and business issues surrounding cloud-based live streaming is provided, including the benefits of cloud computing, the various live streaming architectures, and the challenges that live streaming service providers face in delivering high-quality, real-time services. The different techniques used to improve the performance of video streaming, such as adaptive bit-rate streaming, multicast distribution, and edge computing are discussed and the necessity of low-latency and high-quality video transmission in cloud-based live streaming is underlined. Issues such as improving user experience and live streaming service performance using cutting-edge technology, like artificial intelligence and machine learning are discussed. In addition, the legal and regulatory implications of cloud-based live streaming, including issues with network neutrality, data privacy, and content moderation are addressed. The future of cloud computing for live streaming is examined in the section that follows, and it looks at the most likely new developments in terms of trends and technology. For technology vendors, live streaming service providers, and regulators, the findings have major policy-relevant implications. Suggestions on how stakeholders should address these concerns and take advantage of the potential presented by this rapidly evolving sector, as well as insights into the key challenges and opportunities associated with cloud-based live streaming are provided.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 2","pages":"265-285"},"PeriodicalIF":5.1,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12299","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140242982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DPT-tracker: Dual pooling transformer for efficient visual tracking DPT-tracker:用于高效视觉跟踪的双集合变压器
IF 8.4 2区 计算机科学
CAAI Transactions on Intelligence Technology Pub Date : 2024-03-13 DOI: 10.1049/cit2.12296
Yang Fang, Bailian Xie, Uswah Khairuddin, Zijian Min, Bingbing Jiang, Weisheng Li
{"title":"DPT-tracker: Dual pooling transformer for efficient visual tracking","authors":"Yang Fang,&nbsp;Bailian Xie,&nbsp;Uswah Khairuddin,&nbsp;Zijian Min,&nbsp;Bingbing Jiang,&nbsp;Weisheng Li","doi":"10.1049/cit2.12296","DOIUrl":"10.1049/cit2.12296","url":null,"abstract":"<p>Transformer tracking always takes paired template and search images as encoder input and conduct feature extraction and target-search feature correlation by self and/or cross attention operations, thus the model complexity will grow quadratically with the number of input images. To alleviate the burden of this tracking paradigm and facilitate practical deployment of Transformer-based trackers, we propose a dual pooling transformer tracking framework, dubbed as DPT, which consists of three components: a simple yet efficient spatiotemporal attention model (SAM), a mutual correlation pooling Transformer (MCPT) and a multiscale aggregation pooling Transformer (MAPT). SAM is designed to gracefully aggregates temporal dynamics and spatial appearance information of multi-frame templates along space-time dimensions. MCPT aims to capture multi-scale pooled and correlated contextual features, which is followed by MAPT that aggregates multi-scale features into a unified feature representation for tracking prediction. DPT tracker achieves AUC score of 69.5 on LaSOT and precision score of 82.8 on TrackingNet while maintaining a shorter sequence length of attention tokens, fewer parameters and FLOPs compared to existing state-of-the-art (SOTA) Transformer tracking methods. Extensive experiments demonstrate that DPT tracker yields a strong real-time tracking baseline with a good trade-off between tracking performance and inference efficiency.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 4","pages":"948-959"},"PeriodicalIF":8.4,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12296","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140244948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-granularity feature enhancement network for maritime ship detection 用于海上船舶探测的多粒度特征增强网络
IF 5.1 2区 计算机科学
CAAI Transactions on Intelligence Technology Pub Date : 2024-03-12 DOI: 10.1049/cit2.12310
Li Ying, Duoqian Miao, Zhifei Zhang, Hongyun Zhang, Witold Pedrycz
{"title":"Multi-granularity feature enhancement network for maritime ship detection","authors":"Li Ying,&nbsp;Duoqian Miao,&nbsp;Zhifei Zhang,&nbsp;Hongyun Zhang,&nbsp;Witold Pedrycz","doi":"10.1049/cit2.12310","DOIUrl":"10.1049/cit2.12310","url":null,"abstract":"<p>Due to the characteristics of high resolution and rich texture information, visible light images are widely used for maritime ship detection. However, these images are susceptible to sea fog and ships of different sizes, which can result in missed detections and false alarms, ultimately resulting in lower detection accuracy. To address these issues, a novel multi-granularity feature enhancement network, MFENet, which includes a three-way dehazing module (3WDM) and a multi-granularity feature enhancement module (MFEM) is proposed. The 3WDM eliminates sea fog interference by using an image clarity automatic classification algorithm based on three-way decisions and FFA-Net to obtain clear image samples. Additionally, the MFEM improves the accuracy of detecting ships of different sizes by utilising an improved super-resolution reconstruction convolutional neural network to enhance the resolution and semantic representation capability of the feature maps from YOLOv7. Experimental results demonstrate that MFENet surpasses the other 15 competing models in terms of the mean Average Precision metric on two benchmark datasets, achieving 96.28% on the McShips dataset and 97.71% on the SeaShips dataset.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 3","pages":"649-664"},"PeriodicalIF":5.1,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12310","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140249217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An intelligent prediction model of epidemic characters based on multi-feature 基于多特征的流行病特征智能预测模型
IF 5.1 2区 计算机科学
CAAI Transactions on Intelligence Technology Pub Date : 2024-03-05 DOI: 10.1049/cit2.12294
Xiaoying Wang, Chunmei Li, Yilei Wang, Lin Yin, Qilin Zhou, Rui Zheng, Qingwu Wu, Yuqi Zhou, Min Dai
{"title":"An intelligent prediction model of epidemic characters based on multi-feature","authors":"Xiaoying Wang,&nbsp;Chunmei Li,&nbsp;Yilei Wang,&nbsp;Lin Yin,&nbsp;Qilin Zhou,&nbsp;Rui Zheng,&nbsp;Qingwu Wu,&nbsp;Yuqi Zhou,&nbsp;Min Dai","doi":"10.1049/cit2.12294","DOIUrl":"10.1049/cit2.12294","url":null,"abstract":"<p>The epidemic characters of Omicron (<i>e</i>.<i>g</i>. large-scale transmission) are significantly different from the initial variants of COVID-19. The data generated by large-scale transmission is important to predict the trend of epidemic characters. However, the results of current prediction models are inaccurate since they are not closely combined with the actual situation of Omicron transmission. In consequence, these inaccurate results have negative impacts on the process of the manufacturing and the service industry, for example, the production of masks and the recovery of the tourism industry. The authors have studied the epidemic characters in two ways, that is, investigation and prediction. First, a large amount of data is collected by utilising the Baidu index and conduct questionnaire survey concerning epidemic characters. Second, the <i>β</i>-SEIDR model is established, where the population is classified as Susceptible, Exposed, Infected, Dead and <i>β</i>-Recovered persons, to intelligently predict the epidemic characters of COVID-19. Note that <i>β</i>-Recovered persons denote that the Recovered persons may become Susceptible persons with probability <i>β</i>. The simulation results show that the model can accurately predict the epidemic characters.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 3","pages":"595-607"},"PeriodicalIF":5.1,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12294","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140263286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel myocarditis detection combining deep reinforcement learning and an improved differential evolution algorithm 结合深度强化学习和改进的差分进化算法的新型心肌炎检测方法
IF 8.4 2区 计算机科学
CAAI Transactions on Intelligence Technology Pub Date : 2024-02-15 DOI: 10.1049/cit2.12289
Jing Yang, Touseef Sadiq, Jiale Xiong, Muhammad Awais, Uzair Aslam Bhatti, Roohallah Alizadehsani, Juan Manuel Gorriz
{"title":"A novel myocarditis detection combining deep reinforcement learning and an improved differential evolution algorithm","authors":"Jing Yang,&nbsp;Touseef Sadiq,&nbsp;Jiale Xiong,&nbsp;Muhammad Awais,&nbsp;Uzair Aslam Bhatti,&nbsp;Roohallah Alizadehsani,&nbsp;Juan Manuel Gorriz","doi":"10.1049/cit2.12289","DOIUrl":"10.1049/cit2.12289","url":null,"abstract":"<p>Myocarditis is a serious cardiovascular ailment that can lead to severe consequences if not promptly treated. It is triggered by viral infections and presents symptoms such as chest pain and heart dysfunction. Early detection is crucial for successful treatment, and cardiac magnetic resonance imaging (CMR) is a valuable tool for identifying this condition. However, the detection of myocarditis using CMR images can be challenging due to low contrast, variable noise, and the presence of multiple high CMR slices per patient. To overcome these challenges, the approach proposed incorporates advanced techniques such as convolutional neural networks (CNNs), an improved differential evolution (DE) algorithm for pre-training, and a reinforcement learning (RL)-based model for training. Developing this method presented a significant challenge due to the imbalanced classification of the Z-Alizadeh Sani myocarditis dataset from Omid Hospital in Tehran. To address this, the training process is framed as a sequential decision-making process, where the agent receives higher rewards/penalties for correctly/incorrectly classifying the minority/majority class. Additionally, the authors suggest an enhanced DE algorithm to initiate the backpropagation (BP) process, overcoming the initialisation sensitivity issue of gradient-based methods like back-propagation during the training phase. The effectiveness of the proposed model in diagnosing myocarditis is demonstrated through experimental results based on standard performance metrics. Overall, this method shows promise in expediting the triage of CMR images for automatic screening, facilitating early detection and successful treatment of myocarditis.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 6","pages":"1347-1360"},"PeriodicalIF":8.4,"publicationDate":"2024-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12289","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139963010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel medical image data protection scheme for smart healthcare system 用于智能医疗系统的新型医疗图像数据保护方案
IF 8.4 2区 计算机科学
CAAI Transactions on Intelligence Technology Pub Date : 2024-02-13 DOI: 10.1049/cit2.12292
Mujeeb Ur Rehman, Arslan Shafique, Muhammad Shahbaz Khan, Maha Driss, Wadii Boulila, Yazeed Yasin Ghadi, Suresh Babu Changalasetty, Majed Alhaisoni, Jawad Ahmad
{"title":"A novel medical image data protection scheme for smart healthcare system","authors":"Mujeeb Ur Rehman,&nbsp;Arslan Shafique,&nbsp;Muhammad Shahbaz Khan,&nbsp;Maha Driss,&nbsp;Wadii Boulila,&nbsp;Yazeed Yasin Ghadi,&nbsp;Suresh Babu Changalasetty,&nbsp;Majed Alhaisoni,&nbsp;Jawad Ahmad","doi":"10.1049/cit2.12292","DOIUrl":"10.1049/cit2.12292","url":null,"abstract":"<p>The Internet of Multimedia Things (IoMT) refers to a network of interconnected multimedia devices that communicate with each other over the Internet. Recently, smart healthcare has emerged as a significant application of the IoMT, particularly in the context of knowledge-based learning systems. Smart healthcare systems leverage knowledge-based learning to become more context-aware, adaptable, and auditable while maintaining the ability to learn from historical data. In smart healthcare systems, devices capture images, such as X-rays, Magnetic Resonance Imaging. The security and integrity of these images are crucial for the databases used in knowledge-based learning systems to foster structured decision-making and enhance the learning abilities of AI. Moreover, in knowledge-driven systems, the storage and transmission of HD medical images exert a burden on the limited bandwidth of the communication channel, leading to data transmission delays. To address the security and latency concerns, this paper presents a lightweight medical image encryption scheme utilising bit-plane decomposition and chaos theory. The results of the experiment yield entropy, energy, and correlation values of 7.999, 0.0156, and 0.0001, respectively. This validates the effectiveness of the encryption system proposed in this paper, which offers high-quality encryption, a large key space, key sensitivity, and resistance to statistical attacks.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 4","pages":"821-836"},"PeriodicalIF":8.4,"publicationDate":"2024-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12292","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139841677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge-based deep learning system for classifying Alzheimer's disease for multi-task learning 基于知识的深度学习系统,用于对阿尔茨海默病进行多任务学习分类
IF 8.4 2区 计算机科学
CAAI Transactions on Intelligence Technology Pub Date : 2024-02-08 DOI: 10.1049/cit2.12291
Amol Dattatray Dhaygude, Gaurav Kumar Ameta, Ihtiram Raza Khan, Pavitar Parkash Singh, Renato R. Maaliw III, Natrayan Lakshmaiya, Mohammad Shabaz, Muhammad Attique Khan, Hany S. Hussein, Hammam Alshazly
{"title":"Knowledge-based deep learning system for classifying Alzheimer's disease for multi-task learning","authors":"Amol Dattatray Dhaygude,&nbsp;Gaurav Kumar Ameta,&nbsp;Ihtiram Raza Khan,&nbsp;Pavitar Parkash Singh,&nbsp;Renato R. Maaliw III,&nbsp;Natrayan Lakshmaiya,&nbsp;Mohammad Shabaz,&nbsp;Muhammad Attique Khan,&nbsp;Hany S. Hussein,&nbsp;Hammam Alshazly","doi":"10.1049/cit2.12291","DOIUrl":"10.1049/cit2.12291","url":null,"abstract":"<p>Deep learning has recently become a viable approach for classifying Alzheimer's disease (AD) in medical imaging. However, existing models struggle to efficiently extract features from medical images and may squander additional information resources for illness classification. To address these issues, a deep three-dimensional convolutional neural network incorporating multi-task learning and attention mechanisms is proposed. An upgraded primary C3D network is utilised to create rougher low-level feature maps. It introduces a new convolution block that focuses on the structural aspects of the magnetic resonance imaging image and another block that extracts attention weights unique to certain pixel positions in the feature map and multiplies them with the feature map output. Then, several fully connected layers are used to achieve multi-task learning, generating three outputs, including the primary classification task. The other two outputs employ backpropagation during training to improve the primary classification job. Experimental findings show that the authors’ proposed method outperforms current approaches for classifying AD, achieving enhanced classification accuracy and other indicators on the Alzheimer's disease Neuroimaging Initiative dataset. The authors demonstrate promise for future disease classification studies.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 4","pages":"805-820"},"PeriodicalIF":8.4,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12291","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139792149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hyperspectral image super resolution using deep internal and self-supervised learning 利用深度内部学习和自我监督学习实现高光谱图像超分辨率
IF 5.1 2区 计算机科学
CAAI Transactions on Intelligence Technology Pub Date : 2024-02-01 DOI: 10.1049/cit2.12285
Zhe Liu, Xian-Hua Han
{"title":"Hyperspectral image super resolution using deep internal and self-supervised learning","authors":"Zhe Liu,&nbsp;Xian-Hua Han","doi":"10.1049/cit2.12285","DOIUrl":"https://doi.org/10.1049/cit2.12285","url":null,"abstract":"<p>By automatically learning the priors embedded in images with powerful modelling capabilities, deep learning-based algorithms have recently made considerable progress in reconstructing the high-resolution hyperspectral (HR-HS) image. With previously collected large-amount of external data, these methods are intuitively realised under the full supervision of the ground-truth data. Thus, the database construction in merging the low-resolution (LR) HS (LR-HS) and HR multispectral (MS) or RGB image research paradigm, commonly named as HSI SR, requires collecting corresponding training triplets: HR-MS (RGB), LR-HS and HR-HS image simultaneously, and often faces difficulties in reality. The learned models with the training datasets collected simultaneously under controlled conditions may significantly degrade the HSI super-resolved performance to the real images captured under diverse environments. To handle the above-mentioned limitations, the authors propose to leverage the deep internal and self-supervised learning to solve the HSI SR problem. The authors advocate that it is possible to train a specific CNN model at test time, called as deep internal learning (DIL), by on-line preparing the training triplet samples from the observed LR-HS/HR-MS (or RGB) images and the down-sampled LR-HS version. However, the number of the training triplets extracted solely from the transformed data of the observation itself is extremely few particularly for the HSI SR tasks with large spatial upscale factors, which would result in limited reconstruction performance. To solve this problem, the authors further exploit deep self-supervised learning (DSL) by considering the observations as the unlabelled training samples. Specifically, the degradation modules inside the network were elaborated to realise the spatial and spectral down-sampling procedures for transforming the generated HR-HS estimation to the high-resolution RGB/LR-HS approximation, and then the reconstruction errors of the observations were formulated for measuring the network modelling performance. By consolidating the DIL and DSL into a unified deep framework, the authors construct a more robust HSI SR method without any prior training and have great potential of flexible adaptation to different settings per observation. To verify the effectiveness of the proposed approach, extensive experiments have been conducted on two benchmark HS datasets, including the CAVE and Harvard datasets, and demonstrate the great performance gain of the proposed method over the state-of-the-art methods.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"9 1","pages":"128-141"},"PeriodicalIF":5.1,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.12285","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139732276","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信