Christelle Kemda Ngueda, Julia Palm, Flavia Remo, André Scherag, Lutz Leistritz
{"title":"单样本缺失dna甲基化值估算。","authors":"Christelle Kemda Ngueda, Julia Palm, Flavia Remo, André Scherag, Lutz Leistritz","doi":"10.1186/s12859-025-06154-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Currently, the most popular methods for missing DNA-methylation value imputation rely on exploiting methylation patterns across multiple samples from the same population. However, if there is significant variability between individuals or limited data available, these methods might produce biased results. This situation has prompted researchers to seek alternative approaches for handling single-sample data, particularly in the context of personalized medicine. Accordingly, we propose One-Sample Methyl Imputation (OSMI), an imputation method that can also be used in single-sample applications.</p><p><strong>Results: </strong>The proposed method in single-subject cases yielded an average imputation accuracy of RMSE = 0.2713 (95%-CI from 0.2696 to 0.2730) in β-value units (range: 0-1) based on real 450 K BeadChip data sets of 3,402 individuals. It is possible to take the affiliation of individual CpGs to CpG islands into account during the imputation of missing methylation values. This improves the imputation accuracy. In addition, the accuracy of imputation depends in general on the density of CpG sites on DNA-methylation microarrays and increases as the CpG site density increases. OSMI has low memory and computational requirements.</p><p><strong>Conclusions: </strong>OSMI uses a single methylome to impute missing values quickly at very low memory constraints. Its imputation accuracy is inferior to other methods if multiple samples are available and these samples are reasonably similar, but OSMI represents a useful addition to the imputation toolbox for the case of single-sample applications.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"26 1","pages":"143"},"PeriodicalIF":3.3000,"publicationDate":"2025-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12126866/pdf/","citationCount":"0","resultStr":"{\"title\":\"One-sample missing DNA-methylation value imputation.\",\"authors\":\"Christelle Kemda Ngueda, Julia Palm, Flavia Remo, André Scherag, Lutz Leistritz\",\"doi\":\"10.1186/s12859-025-06154-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Currently, the most popular methods for missing DNA-methylation value imputation rely on exploiting methylation patterns across multiple samples from the same population. However, if there is significant variability between individuals or limited data available, these methods might produce biased results. This situation has prompted researchers to seek alternative approaches for handling single-sample data, particularly in the context of personalized medicine. Accordingly, we propose One-Sample Methyl Imputation (OSMI), an imputation method that can also be used in single-sample applications.</p><p><strong>Results: </strong>The proposed method in single-subject cases yielded an average imputation accuracy of RMSE = 0.2713 (95%-CI from 0.2696 to 0.2730) in β-value units (range: 0-1) based on real 450 K BeadChip data sets of 3,402 individuals. It is possible to take the affiliation of individual CpGs to CpG islands into account during the imputation of missing methylation values. This improves the imputation accuracy. In addition, the accuracy of imputation depends in general on the density of CpG sites on DNA-methylation microarrays and increases as the CpG site density increases. OSMI has low memory and computational requirements.</p><p><strong>Conclusions: </strong>OSMI uses a single methylome to impute missing values quickly at very low memory constraints. Its imputation accuracy is inferior to other methods if multiple samples are available and these samples are reasonably similar, but OSMI represents a useful addition to the imputation toolbox for the case of single-sample applications.</p>\",\"PeriodicalId\":8958,\"journal\":{\"name\":\"BMC Bioinformatics\",\"volume\":\"26 1\",\"pages\":\"143\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-05-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12126866/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Bioinformatics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s12859-025-06154-9\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12859-025-06154-9","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
摘要
背景:目前,最流行的缺失dna甲基化值估算方法依赖于利用来自同一人群的多个样本的甲基化模式。然而,如果个体之间存在显著差异或可用数据有限,这些方法可能会产生有偏差的结果。这种情况促使研究人员寻求处理单样本数据的替代方法,特别是在个性化医疗的背景下。因此,我们提出了一种可用于单样本应用的单样本甲基归算方法(OSMI)。结果:在单受试者情况下,基于3,402个人的450 K BeadChip真实数据集,以β值单位(范围:0-1)为单位,所提出的方法的平均输入精度RMSE = 0.2713 (95% ci为0.2696至0.2730)。在缺失甲基化值的估算过程中,可以考虑单个CpG与CpG岛的关系。这提高了插补精度。此外,插入的准确性通常取决于dna甲基化微阵列上CpG位点的密度,并随着CpG位点密度的增加而增加。OSMI具有较低的内存和计算需求。结论:OSMI使用单个甲基组在非常低的记忆限制下快速估算缺失值。如果有多个样本可用,并且这些样本相当相似,那么OSMI的输入精度不如其他方法,但是对于单样本应用程序,OSMI是对输入工具箱的有用补充。
One-sample missing DNA-methylation value imputation.
Background: Currently, the most popular methods for missing DNA-methylation value imputation rely on exploiting methylation patterns across multiple samples from the same population. However, if there is significant variability between individuals or limited data available, these methods might produce biased results. This situation has prompted researchers to seek alternative approaches for handling single-sample data, particularly in the context of personalized medicine. Accordingly, we propose One-Sample Methyl Imputation (OSMI), an imputation method that can also be used in single-sample applications.
Results: The proposed method in single-subject cases yielded an average imputation accuracy of RMSE = 0.2713 (95%-CI from 0.2696 to 0.2730) in β-value units (range: 0-1) based on real 450 K BeadChip data sets of 3,402 individuals. It is possible to take the affiliation of individual CpGs to CpG islands into account during the imputation of missing methylation values. This improves the imputation accuracy. In addition, the accuracy of imputation depends in general on the density of CpG sites on DNA-methylation microarrays and increases as the CpG site density increases. OSMI has low memory and computational requirements.
Conclusions: OSMI uses a single methylome to impute missing values quickly at very low memory constraints. Its imputation accuracy is inferior to other methods if multiple samples are available and these samples are reasonably similar, but OSMI represents a useful addition to the imputation toolbox for the case of single-sample applications.
期刊介绍:
BMC Bioinformatics is an open access, peer-reviewed journal that considers articles on all aspects of the development, testing and novel application of computational and statistical methods for the modeling and analysis of all kinds of biological data, as well as other areas of computational biology.
BMC Bioinformatics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.