Andreas Zamanos, Panagiotis Koromilas, Giorgos Bouritsas, Panagiotis L Kastritis, Yannis Panagakis
{"title":"Self-supervised learning for generalizable particle picking in cryo-EM micrographs.","authors":"Andreas Zamanos, Panagiotis Koromilas, Giorgos Bouritsas, Panagiotis L Kastritis, Yannis Panagakis","doi":"10.1016/j.crmeth.2025.101089","DOIUrl":null,"url":null,"abstract":"<p><p>We present cryoelectron microscopy masked autoencoder (cryo-EMMAE), a self-supervised method designed to overcome the need for manually annotated cryo-EM data. cryo-EMMAE leverages the representation space of a masked autoencoder to pick particle pixels through clustering of the MAE latent representation. Evaluation across different EMPIAR datasets demonstrates that cryo-EMMAE outperforms state-of-the-art supervised methods in terms of generalization capabilities. Importantly, our method showcases consistent performance, independent of the dataset used for training. Additionally, cryo-EMMAE is data efficient, as we experimentally observe that it converges with as few as five micrographs. Further, 3D reconstruction results indicate that our method has superior performance in reconstructing the volumes in both single-particle datasets and multi-particle micrographs derived from cell extracts. Our results underscore the potential of self-supervised learning in advancing cryo-EM image analysis, offering an alternative for more efficient and cost-effective structural biology research. Code is available at https://github.com/azamanos/Cryo-EMMAE.</p>","PeriodicalId":29773,"journal":{"name":"Cell Reports Methods","volume":" ","pages":"101089"},"PeriodicalIF":4.3000,"publicationDate":"2025-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cell Reports Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.crmeth.2025.101089","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
We present cryoelectron microscopy masked autoencoder (cryo-EMMAE), a self-supervised method designed to overcome the need for manually annotated cryo-EM data. cryo-EMMAE leverages the representation space of a masked autoencoder to pick particle pixels through clustering of the MAE latent representation. Evaluation across different EMPIAR datasets demonstrates that cryo-EMMAE outperforms state-of-the-art supervised methods in terms of generalization capabilities. Importantly, our method showcases consistent performance, independent of the dataset used for training. Additionally, cryo-EMMAE is data efficient, as we experimentally observe that it converges with as few as five micrographs. Further, 3D reconstruction results indicate that our method has superior performance in reconstructing the volumes in both single-particle datasets and multi-particle micrographs derived from cell extracts. Our results underscore the potential of self-supervised learning in advancing cryo-EM image analysis, offering an alternative for more efficient and cost-effective structural biology research. Code is available at https://github.com/azamanos/Cryo-EMMAE.