O Lauzirika,M Pernica,D Herreros,E Ramírez-Aportela,J Krieger,M Gragera,M Iceta,P Conesa,Y Fonseca,J Jiménez,J Filipovic,J M Carazo,C O S Sorzano
{"title":"在单粒子分析中,我们可以识别出多少(可区分的)类别?","authors":"O Lauzirika,M Pernica,D Herreros,E Ramírez-Aportela,J Krieger,M Gragera,M Iceta,P Conesa,Y Fonseca,J Jiménez,J Filipovic,J M Carazo,C O S Sorzano","doi":"10.1107/s2059798325007831","DOIUrl":null,"url":null,"abstract":"Heterogeneity in cryoEM is essential for capturing the structural variability of macromolecules, reflecting their functional states and biological significance. However, estimating heterogeneity remains challenging due to particle misclassification and algorithmic biases, which can lead to reconstructions that blend distinct conformations or fail to resolve subtle differences. Furthermore, the low signal-to-noise ratio inherent in cryo-EM data makes it nearly impossible to detect minute structural changes, as noise often obscures subtle variations in macromolecular projections. In this paper, we investigate the use of p-values associated with the null hypothesis that the observed classification differs from a random partition of the input data set, thereby providing a statistical framework for determining the number of distinguishable classes present in a given data set.","PeriodicalId":501686,"journal":{"name":"Acta Crystallographica Section D","volume":"37 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"How many (distinguishable) classes can we identify in single-particle analysis?\",\"authors\":\"O Lauzirika,M Pernica,D Herreros,E Ramírez-Aportela,J Krieger,M Gragera,M Iceta,P Conesa,Y Fonseca,J Jiménez,J Filipovic,J M Carazo,C O S Sorzano\",\"doi\":\"10.1107/s2059798325007831\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Heterogeneity in cryoEM is essential for capturing the structural variability of macromolecules, reflecting their functional states and biological significance. However, estimating heterogeneity remains challenging due to particle misclassification and algorithmic biases, which can lead to reconstructions that blend distinct conformations or fail to resolve subtle differences. Furthermore, the low signal-to-noise ratio inherent in cryo-EM data makes it nearly impossible to detect minute structural changes, as noise often obscures subtle variations in macromolecular projections. In this paper, we investigate the use of p-values associated with the null hypothesis that the observed classification differs from a random partition of the input data set, thereby providing a statistical framework for determining the number of distinguishable classes present in a given data set.\",\"PeriodicalId\":501686,\"journal\":{\"name\":\"Acta Crystallographica Section D\",\"volume\":\"37 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Acta Crystallographica Section D\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1107/s2059798325007831\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Crystallographica Section D","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1107/s2059798325007831","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
How many (distinguishable) classes can we identify in single-particle analysis?
Heterogeneity in cryoEM is essential for capturing the structural variability of macromolecules, reflecting their functional states and biological significance. However, estimating heterogeneity remains challenging due to particle misclassification and algorithmic biases, which can lead to reconstructions that blend distinct conformations or fail to resolve subtle differences. Furthermore, the low signal-to-noise ratio inherent in cryo-EM data makes it nearly impossible to detect minute structural changes, as noise often obscures subtle variations in macromolecular projections. In this paper, we investigate the use of p-values associated with the null hypothesis that the observed classification differs from a random partition of the input data set, thereby providing a statistical framework for determining the number of distinguishable classes present in a given data set.