Tsinghua Science and Technology最新文献

筛选
英文 中文
Advanced Deep Neural Network with Unified Feature-Aware and Label Embedding for Multi-Label Arrhythmias Classification 基于统一特征感知和标签嵌入的高级深度神经网络多标签心律失常分类
IF 6.6 1区 计算机科学
Tsinghua Science and Technology Pub Date : 2024-12-30 DOI: 10.26599/TST.2023.9010162
Pan Xia;Zhongrui Bai;Yicheng Yao;Lirui Xu;Hao Zhang;Lidong Du;Xianxiang Chen;Qiao Ye;Yusi Zhu;Peng Wang;Xiaoran Li;Guangyun Wang;Zhen Fang
{"title":"Advanced Deep Neural Network with Unified Feature-Aware and Label Embedding for Multi-Label Arrhythmias Classification","authors":"Pan Xia;Zhongrui Bai;Yicheng Yao;Lirui Xu;Hao Zhang;Lidong Du;Xianxiang Chen;Qiao Ye;Yusi Zhu;Peng Wang;Xiaoran Li;Guangyun Wang;Zhen Fang","doi":"10.26599/TST.2023.9010162","DOIUrl":"https://doi.org/10.26599/TST.2023.9010162","url":null,"abstract":"Multi-label arrhythmias classification is of great significance to the diagnosis of cardiovascular disease, and it is a challenging task as it requires identifying the label subset most related to each instance. In this paper, by integrating a deep residual neural network and auto-encoder, we propose an advanced deep neural network (DNN) framework with unified feature-aware and label embedding to perform multi-label arrhythmias classification involving 30 types of arrhythmias. Firstly, a deep residual neural network is built to extract the complex pathological features from varying-dimensional electrocardiograms (ECGs). Secondly, the mean square error loss is adopted to learn a latent space associating the deep pathological features and the corresponding label data, and then to achieve unified feature-label embedding. Thirdly, the label-correlation aware loss is introduced to optimize the auto-encoder architecture, which enables our model to exploit label-correlation for improved multi-label prediction. Our proposed DNN model can allow end-to-end training and prediction, which can perform feature-aware, label embedding, and label-correlation aware prediction in a unified framework. Finally, our proposed model is evaluated on the currently largest public dataset worldwide, and achieves the challenge metric scores of 0.492, 0.495, and 0.490 on the 12-lead, 3-lead, and all-lead version ECGs, respectively. The performance of our approach outperforms other current state-of-the-art methods in the leave-one-dataset-out cross-validation setting, which demonstrates that our approach has great competitiveness in identifying a wider range of multi-label arrhythmias.","PeriodicalId":48690,"journal":{"name":"Tsinghua Science and Technology","volume":"30 3","pages":"1251-1269"},"PeriodicalIF":6.6,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10817700","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142905810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AMTrans: Auto-Correlation Multi-Head Attention Transformer for Infrared Spectral Deconvolution 红外光谱反褶积自相关多头注意转换器
IF 6.6 1区 计算机科学
Tsinghua Science and Technology Pub Date : 2024-12-30 DOI: 10.26599/TST.2024.9010131
Lei Gao;Liyuan Cui;Shuwen Chen;Lizhen Deng;Xiaokang Wang;Xiaohong Yan;Hu Zhu
{"title":"AMTrans: Auto-Correlation Multi-Head Attention Transformer for Infrared Spectral Deconvolution","authors":"Lei Gao;Liyuan Cui;Shuwen Chen;Lizhen Deng;Xiaokang Wang;Xiaohong Yan;Hu Zhu","doi":"10.26599/TST.2024.9010131","DOIUrl":"https://doi.org/10.26599/TST.2024.9010131","url":null,"abstract":"Infrared spectroscopy analysis has found widespread applications in various fields due to advancements in technology and industry convergence. To improve the quality and reliability of infrared spectroscopy signals, deconvolution is a crucial preprocessing step. Inspired by the transformer model, we propose an Auto-correlation Multi-head attention Transformer (AMTrans) for infrared spectrum sequence deconvolution. The auto-correlation attention model improves the scaled dot-product attention in the transformer. It utilizes attention mechanism for feature extraction and implements attention computation using the auto-correlation function. The auto-correlation attention model is used to exploit the inherent sequence nature of spectral data and to effectively recovery spectra by capturing auto-correlation patterns in the sequence. The proposed model is trained using supervised learning and demonstrates promising results in infrared spectroscopic restoration. By comparing the experiments with other deconvolution techniques, the experimental results show that the method has excellent deconvolution performance and can effectively recover the texture details of the infrared spectrum.","PeriodicalId":48690,"journal":{"name":"Tsinghua Science and Technology","volume":"30 3","pages":"1329-1341"},"PeriodicalIF":6.6,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10817762","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142905911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Fine-Grained Image Classification Model Based on Hybrid Attention and Pyramidal Convolution 基于混合注意力和金字塔卷积的细粒度图像分类模型
IF 6.6 1区 计算机科学
Tsinghua Science and Technology Pub Date : 2024-12-30 DOI: 10.26599/TST.2024.9010025
Sifeng Wang;Shengxiang Li;Anran Li;Zhaoan Dong;Guangshun Li;Chao Yan
{"title":"A Fine-Grained Image Classification Model Based on Hybrid Attention and Pyramidal Convolution","authors":"Sifeng Wang;Shengxiang Li;Anran Li;Zhaoan Dong;Guangshun Li;Chao Yan","doi":"10.26599/TST.2024.9010025","DOIUrl":"https://doi.org/10.26599/TST.2024.9010025","url":null,"abstract":"Finding more specific subcategories within a larger category is the goal of fine-grained image classification (FGIC), and the key is to find local discriminative regions of visual features. Most existing methods use traditional convolutional operations to achieve fine-grained image classification. However, traditional convolution cannot extract multi-scale features of an image and existing methods are susceptible to interference from image background information. Therefore, to address the above problems, this paper proposes an FGIC model (Attention-PCNN) based on hybrid attention mechanism and pyramidal convolution. The model feeds the multi-scale features extracted by the pyramidal convolutional neural network into two branches capturing global and local information respectively. In particular, a hybrid attention mechanism is added to the branch capturing global information in order to reduce the interference of image background information and make the model pay more attention to the target region with fine-grained features. In addition, the mutual-channel loss (MC-LOSS) is introduced in the local information branch to capture fine-grained features. We evaluated the model on three publicly available datasets CUB-200-2011, Stanford Cars, FGVC-Aircraft, etc. Compared to the state-of-the-art methods, the results show that Attention-PCNN performs better.","PeriodicalId":48690,"journal":{"name":"Tsinghua Science and Technology","volume":"30 3","pages":"1283-1293"},"PeriodicalIF":6.6,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10817763","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142905727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adversarial Attack on Object Detection via Object Feature-Wise Attention and Perturbation Extraction 基于目标特征关注和扰动提取的目标检测对抗攻击
IF 6.6 1区 计算机科学
Tsinghua Science and Technology Pub Date : 2024-12-30 DOI: 10.26599/TST.2024.9010029
Wei Xue;Xiaoyan Xia;Pengcheng Wan;Ping Zhong;Xiao Zheng
{"title":"Adversarial Attack on Object Detection via Object Feature-Wise Attention and Perturbation Extraction","authors":"Wei Xue;Xiaoyan Xia;Pengcheng Wan;Ping Zhong;Xiao Zheng","doi":"10.26599/TST.2024.9010029","DOIUrl":"https://doi.org/10.26599/TST.2024.9010029","url":null,"abstract":"Deep neural networks are commonly used in computer vision tasks, but they are vulnerable to adversarial samples, resulting in poor recognition accuracy. Although traditional algorithms that craft adversarial samples have been effective in attacking classification models, the attacking performance degrades when facing object detection models with more complex structures. To address this issue better, in this paper we first analyze the mechanism of multi-scale feature extraction of object detection models, and then by constructing the object feature-wise attention module and the perturbation extraction module, a novel adversarial sample generation algorithm for attacking detection models is proposed. Specifically, in the first module, based on the multi-scale feature map, we reduce the range of perturbation and improve the stealthiness of adversarial samples by computing the noise distribution in the object region. Then in the second module, we feed the noise distribution into the generative adversarial networks to generate adversarial perturbation with strong attack transferability. By doing so, the proposed approach possesses the ability to better confuse the judgment of detection models. Experiments carried out on the DroneVehicle dataset show that our method is computationally efficient and works well in attacking detection models measured by qualitative analysis and quantitative analysis.","PeriodicalId":48690,"journal":{"name":"Tsinghua Science and Technology","volume":"30 3","pages":"1174-1189"},"PeriodicalIF":6.6,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10817718","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142905809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sphere Decoding for Binary Polar Codes with the Modified Multiplicative Repetition Construction 基于改进的乘式重复结构的二元极码球面译码
IF 6.6 1区 计算机科学
Tsinghua Science and Technology Pub Date : 2024-12-30 DOI: 10.26599/TST.2024.9010030
Haiqiang Chen;Yuanbo Liu;Shuping Dang;Qingnian Li;Youming Sun;Xiangcheng Li
{"title":"Sphere Decoding for Binary Polar Codes with the Modified Multiplicative Repetition Construction","authors":"Haiqiang Chen;Yuanbo Liu;Shuping Dang;Qingnian Li;Youming Sun;Xiangcheng Li","doi":"10.26599/TST.2024.9010030","DOIUrl":"https://doi.org/10.26599/TST.2024.9010030","url":null,"abstract":"Compared to the successive cancellation (SC)-based decoding algorithms, the sphere decoding (SD) algorithm can achieve better performance with reduced computational complexity, especially for short polar codes. In this paper, we propose a new method to construct the binary polar codes with the modified multiplicative repetition (MR)-based matrix. Different from the original construction, we first design a \u0000<tex>$2times 2 qtext{-ary}$</tex>\u0000 kernel to guarantee the single-level polarization effect. Then, by replacing the new-designed binary companion matrix, a novel strategy is further developed to enhance the polarization in the bit level, resulting in a better distance property. Finally, the SD-based Monte-Carlo (SDMC) method is used to construct MR-based binary polar codes, while the resulting codes without the butterfly pattern are decoded by the SD algorithm. Simulation results show that the proposed method with the SD algorithm can achieve a maximum performance gain of 0.27 dB compared to the original method with slightly lower complexity.","PeriodicalId":48690,"journal":{"name":"Tsinghua Science and Technology","volume":"30 3","pages":"1229-1236"},"PeriodicalIF":6.6,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10817720","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142905814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Learning Based Side-Channel Attack Detection for Mobile Devices Security in 5G Networks 基于深度学习的5G移动设备侧信道攻击安全检测
IF 6.6 1区 计算机科学
Tsinghua Science and Technology Pub Date : 2024-12-30 DOI: 10.26599/TST.2024.9010123
Amjed A. Ahmed;Mohammad Kamrul Hasan;Ali Alqahtani;Shayla Islam;Bishwajeet Pandey;Leila Rzayeva;Huda Saleh Abbas;Azana Hafizah Mohd Aman;Nayef Alqahtani
{"title":"Deep Learning Based Side-Channel Attack Detection for Mobile Devices Security in 5G Networks","authors":"Amjed A. Ahmed;Mohammad Kamrul Hasan;Ali Alqahtani;Shayla Islam;Bishwajeet Pandey;Leila Rzayeva;Huda Saleh Abbas;Azana Hafizah Mohd Aman;Nayef Alqahtani","doi":"10.26599/TST.2024.9010123","DOIUrl":"https://doi.org/10.26599/TST.2024.9010123","url":null,"abstract":"Mobile devices within Fifth Generation (5G) networks, typically equipped with Android systems, serve as a bridge to connect digital gadgets such as global positioning system, mobile devices, and wireless routers, which are vital in facilitating end-user communication requirements. However, the security of Android systems has been challenged by the sensitive data involved, leading to vulnerabilities in mobile devices used in 5G networks. These vulnerabilities expose mobile devices to cyber-attacks, primarily resulting from security gaps. Zero-permission apps in Android can exploit these channels to access sensitive information, including user identities, login credentials, and geolocation data. One such attack leverages “zero-permission” sensors like accelerometers and gyroscopes, enabling attackers to gather information about the smartphone's user. This underscores the importance of fortifying mobile devices against potential future attacks. Our research focuses on a new recurrent neural network prediction model, which has proved highly effective for detecting side-channel attacks in mobile devices in 5G networks. We conducted state-of-the-art comparative studies to validate our experimental approach. The results demonstrate that even a small amount of training data can accurately recognize 37.5% of previously unseen user-typed words. Moreover, our tap detection mechanism achieves a 92% accuracy rate, a crucial factor for text inference. These findings have significant practical implications, as they reinforce mobile device security in 5G networks, enhancing user privacy, and data protection.","PeriodicalId":48690,"journal":{"name":"Tsinghua Science and Technology","volume":"30 3","pages":"1012-1026"},"PeriodicalIF":6.6,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10817772","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142905844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Dwell Scheduling Based on Dual-Side Time Pointers for Simultaneous Multi-Beam Radar 基于双面时间指针的多波束雷达自适应驻留调度
IF 6.6 1区 计算机科学
Tsinghua Science and Technology Pub Date : 2024-12-30 DOI: 10.26599/TST.2023.9010161
Siyu Heng;Ting Cheng;Jiaming Song;Zishu He;Luqing Liu;Yuanqing Wang
{"title":"Adaptive Dwell Scheduling Based on Dual-Side Time Pointers for Simultaneous Multi-Beam Radar","authors":"Siyu Heng;Ting Cheng;Jiaming Song;Zishu He;Luqing Liu;Yuanqing Wang","doi":"10.26599/TST.2023.9010161","DOIUrl":"https://doi.org/10.26599/TST.2023.9010161","url":null,"abstract":"Adaptive dwell scheduling is essential to achieve full performance for a simultaneous multi-beam radar system. The dwell scheduling for such a radar system considering desired execution time criterion is studied in this paper. The primary objective of this model is to achieve maximum scheduling gain and minimum scheduling cost while adhering to not only time, aperture, and frequency constraints, but also electromagnetic compatibility (EMC) constraint. The dwell scheduling algorithm is proposed to solve the above optimization problem, where several separation points are set on the timeline, so that each separator divides the scheduling interval into two sides. For the two sides, the dual-side time pointers are introduced, which move from the separator to both ends of the scheduling interval. The dwell tasks are analyzed in sequence at each analysis point based on their two-level synthetical priority. These tasks are then executed simultaneously by sharing the whole aperture under various constraints to accomplish multiple tasks concurrently. The above process is respectively conducted at each separator, and the final scheduling result is the one with the minimal cost among all. Simulation results prove that the proposed algorithm can achieve real-time dwell scheduling and outperform the existing algorithms in terms of scheduling performance.","PeriodicalId":48690,"journal":{"name":"Tsinghua Science and Technology","volume":"30 3","pages":"1190-1200"},"PeriodicalIF":6.6,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10817723","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142905812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Integrated Blockchain Framework for Secure Data Sharing in IoT Fog Computing 物联网雾计算中安全数据共享的集成区块链框架
IF 6.6 1区 计算机科学
Tsinghua Science and Technology Pub Date : 2024-12-30 DOI: 10.26599/TST.2024.9010082
Peda Narayana Bathula;M. Sreenivasulu
{"title":"An Integrated Blockchain Framework for Secure Data Sharing in IoT Fog Computing","authors":"Peda Narayana Bathula;M. Sreenivasulu","doi":"10.26599/TST.2024.9010082","DOIUrl":"https://doi.org/10.26599/TST.2024.9010082","url":null,"abstract":"The importance of secure data sharing in fog computing is increasing due to the growing number of Internet of Things (IoT) devices. This article addresses the privacy and security issues brought up by data sharing in the context of IoT fog computing. The suggested framework, called “BlocFogSec”, secures key management and data sharing through blockchain consensus and smart contracts. Unlike existing solutions, BlocFogSec utilizes two types of smart contracts for secure key exchange and data sharing, while employing a consensus protocol to validate transactions and maintain blockchain integrity. To process and store data effectively at the network edge, the framework makes use of fog computing, notably reducing latency and raising throughput. BlocFogSec successfully blocks unauthorized access and data breaches by restricting transactions to authorized nodes. In addition, the framework uses a consensus protocol to validate and add transactions to the blockchain, guaranteeing data accuracy and immutability. To compare BlocFogSec's performance to that of other models, a number of simulations are conducted. The simulation results indicate that BlocFogSec consistently outperforms existing models, such as Security Services for Fog Computing (SSFC) and Blockchain-based Key Management Scheme (BKMS), in terms of throughput (up to 5135 bytes per second), latency (as low as 7 ms), and resource utilization (70% to 92%). The evaluation also takes into account attack defending accuracy (up to 100%), precision (up to 100%), and recall (up to 99.6%), demonstrating BlocFogSec's effectiveness in identifying and preventing potential attacks.","PeriodicalId":48690,"journal":{"name":"Tsinghua Science and Technology","volume":"30 3","pages":"957-977"},"PeriodicalIF":6.6,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10817699","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142905887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Efficient Quantum Enabled Machine Algorithm by Universal Features for Predicting Botnet Attacks in Digital Twin Enabled IoT Networks 基于通用特征的高效量子机器算法用于预测数字孪生物联网中僵尸网络攻击
IF 6.6 1区 计算机科学
Tsinghua Science and Technology Pub Date : 2024-12-30 DOI: 10.26599/TST.2024.9010052
Katta Rajesh Babu;Naramula Venkatesh;K. Shashidhar;Yellampalli Dasaratha Rami Reddy;K. Naga Prakash
{"title":"An Efficient Quantum Enabled Machine Algorithm by Universal Features for Predicting Botnet Attacks in Digital Twin Enabled IoT Networks","authors":"Katta Rajesh Babu;Naramula Venkatesh;K. Shashidhar;Yellampalli Dasaratha Rami Reddy;K. Naga Prakash","doi":"10.26599/TST.2024.9010052","DOIUrl":"https://doi.org/10.26599/TST.2024.9010052","url":null,"abstract":"In this manuscript, the authors introduce a quantum enabled Reinforcement Algorithm by Universal Features (REMF) as a lightweight solution designed to identify and assess the impact of botnet attacks on 5G Internet of Things (IoT) networks. REMF's primary objective is the swift detection of botnet assaults and their effects, aiming to prevent the initiation of such attacks. The algorithm introduces a novel adaptive classification boosting through reinforcement learning, training on values derived from universal features extracted from network transactions within a given training corpus. During the prediction phase, REMF assesses the Botnet attack confidence of feature values obtained from unlabeled network transactions. It then compares these botnet attack confidence values with the botnet attack confidence of optimal features derived during the training phase to predict the potential impact of the botnet attack, categorizing it as high, moderate, low, or not-an-attack (normal). The performance evaluation results demonstrate that REMF achieves the highest decision accuracy, displaying maximum sensitivity and specificity in predicting the scope of botnet attacks at an early stage. The experimental study illustrates that REMF outperforms existing detection techniques for predicting botnet attacks.","PeriodicalId":48690,"journal":{"name":"Tsinghua Science and Technology","volume":"30 3","pages":"947-956"},"PeriodicalIF":6.6,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10817765","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142905888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Few-Shot Object Detection via Dual-Domain Feature Fusion and Patch-Level Attention 基于双域特征融合和补丁级关注的小镜头目标检测
IF 6.6 1区 计算机科学
Tsinghua Science and Technology Pub Date : 2024-12-30 DOI: 10.26599/TST.2024.9010031
Guangli Ren;Jierui Liu;Mengyao Wang;Peiyu Guan;Zhiqiang Cao;Junzhi Yu
{"title":"Few-Shot Object Detection via Dual-Domain Feature Fusion and Patch-Level Attention","authors":"Guangli Ren;Jierui Liu;Mengyao Wang;Peiyu Guan;Zhiqiang Cao;Junzhi Yu","doi":"10.26599/TST.2024.9010031","DOIUrl":"https://doi.org/10.26599/TST.2024.9010031","url":null,"abstract":"Few-shot object detection receives much attention with the ability to detect novel class objects using limited annotated data. The transfer learning-based solution becomes popular due to its simple training with good accuracy, however, it is still challenging to enrich the feature diversity during the training process. And fine-grained features are also insufficient for novel class detection. To deal with the problems, this paper proposes a novel few-shot object detection method based on dual-domain feature fusion and patch-level attention. Upon original base domain, an elementary domain with more category-agnostic features is superposed to construct a two-stream backbone, which benefits to enrich the feature diversity. To better integrate various features, a dual-domain feature fusion is designed, where the feature pairs with the same size are complementarily fused to extract more discriminative features. Moreover, a patch-wise feature refinement termed as patch-level attention is presented to mine internal relations among the patches, which enhances the adaptability to novel classes. In addition, a weighted classification loss is given to assist the fine-tuning of the classifier by combining extra features from FPN of the base training model. In this way, the few-shot detection quality to novel class objects is improved. Experiments on PASCAL VOC and MS COCO datasets verify the effectiveness of the method.","PeriodicalId":48690,"journal":{"name":"Tsinghua Science and Technology","volume":"30 3","pages":"1237-1250"},"PeriodicalIF":6.6,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10817768","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142905733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信