POPAR: Patch Order Prediction and Appearance Recovery for self-supervised learning in chest radiography

IF 11.8 1区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Medical image analysis Pub Date : 2025-07-19 DOI:10.1016/j.media.2025.103720

Jiaxuan Pang , Dongao Ma , Ziyu Zhou , Michael B. Gotway , Jianming Liang

{"title":"POPAR: Patch Order Prediction and Appearance Recovery for self-supervised learning in chest radiography","authors":"Jiaxuan Pang , Dongao Ma , Ziyu Zhou , Michael B. Gotway , Jianming Liang","doi":"10.1016/j.media.2025.103720","DOIUrl":null,"url":null,"abstract":"<div><div>Self-supervised learning (SSL) has proven effective in reducing the dependency on large annotated datasets while achieving state-of-the-art (SoTA) performance in computer vision. However, its adoption in medical imaging remains slow due to fundamental differences between photographic and medical images. To address this, we propose POPAR (Patch Order Prediction and Appearance Recovery), a novel SSL framework tailored for medical image analysis, particularly chest X-ray interpretation. POPAR introduces two key learning strategies: (1) Patch order prediction, which helps the model learn anatomical structures and spatial relationships by predicting the arrangement of shuffled patches, and (2) Patch appearance recovery, which reconstructs fine-grained details to enhance texture-based feature learning. Using a Swin Transformer backbone, POPAR is pretrained on a large-scale dataset and extensively evaluated across multiple tasks, outperforming both SSL and fully supervised SoTA models in classification, segmentation, anatomical understanding, bias robustness, and data efficiency. Our findings highlight POPAR’s scalability, strong generalization, and effectiveness in medical imaging applications. All code and models are available at <span><span>GitHub.com/JLiangLab/POPAR</span><svg><path></path></svg></span> (Version 2).</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"105 ","pages":"Article 103720"},"PeriodicalIF":11.8000,"publicationDate":"2025-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525002671","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Self-supervised learning (SSL) has proven effective in reducing the dependency on large annotated datasets while achieving state-of-the-art (SoTA) performance in computer vision. However, its adoption in medical imaging remains slow due to fundamental differences between photographic and medical images. To address this, we propose POPAR (Patch Order Prediction and Appearance Recovery), a novel SSL framework tailored for medical image analysis, particularly chest X-ray interpretation. POPAR introduces two key learning strategies: (1) Patch order prediction, which helps the model learn anatomical structures and spatial relationships by predicting the arrangement of shuffled patches, and (2) Patch appearance recovery, which reconstructs fine-grained details to enhance texture-based feature learning. Using a Swin Transformer backbone, POPAR is pretrained on a large-scale dataset and extensively evaluated across multiple tasks, outperforming both SSL and fully supervised SoTA models in classification, segmentation, anatomical understanding, bias robustness, and data efficiency. Our findings highlight POPAR’s scalability, strong generalization, and effectiveness in medical imaging applications. All code and models are available at GitHub.com/JLiangLab/POPAR (Version 2).

Abstract Image

查看原文本刊更多论文

胸片自监督学习的贴片顺序预测和外观恢复

自监督学习（SSL）已被证明可以有效地减少对大型注释数据集的依赖，同时在计算机视觉中实现最先进的（SoTA）性能。然而，由于摄影和医学图像之间的根本差异，它在医学成像中的应用仍然缓慢。为了解决这个问题，我们提出了POPAR（补丁顺序预测和外观恢复），这是一种为医学图像分析，特别是胸部x射线解释量身定制的新型SSL框架。POPAR引入了两个关键的学习策略：(1)斑块顺序预测，通过预测洗牌斑块的排列，帮助模型学习解剖结构和空间关系；(2)斑块外观恢复，重建细粒度细节，增强基于纹理的特征学习。使用Swin Transformer主干，POPAR在大规模数据集上进行预训练，并在多个任务中进行广泛评估，在分类、分割、解剖理解、偏差鲁棒性和数据效率方面优于SSL和完全监督的SoTA模型。我们的发现突出了POPAR在医学成像应用中的可扩展性、强通用性和有效性。所有代码和模型都可以在GitHub.com/JLiangLab/POPAR （Version 2）上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Medical image analysis 工程技术-工程：生物医学

CiteScore

22.10

自引率

6.40%

发文量

309

审稿时长

6.6 months

期刊介绍： Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.