Hanneke Leegwater, Zhengzheng Zhang, Xiaobing Zhang, Thomas Hankemeier, Amy C. Harms, Annelien J. M. Zweemer, Sylvia E. Le Dévédec, Alida Kindt
{"title":"Normalization Strategies for Lipidome Data in Cell Line Panels","authors":"Hanneke Leegwater, Zhengzheng Zhang, Xiaobing Zhang, Thomas Hankemeier, Amy C. Harms, Annelien J. M. Zweemer, Sylvia E. Le Dévédec, Alida Kindt","doi":"10.1002/cem.3636","DOIUrl":null,"url":null,"abstract":"<p>Sample collection can significantly affect lipid concentration measurements in cell line panels, concealing intrinsic differences between cancer subtypes. Most quality control steps in lipidomic data analysis focus on controlling technical variation. Correcting for the total amount of biological material remains an additional challenge for cell line panels. Here, we investigated how we can normalize lipidomic data acquired from multiple cell lines to correct for differences in sample biomass. We studied how commonly used data normalization and transformation strategies influence the resulting lipid data distributions. We compared normalization by biological properties including cell count and total protein concentration, to statistical and data-based approaches, such as median, mean, or probabilistic quotient-based normalization. We used intraclass correlations to estimate how normalization influenced the similarity between replicates. Normalizing lipidomic data by cell count improved the similarity between replicates but only for cell lines with similar morphologies. When comparing cell line panels with diverse morphologies neither cell count nor protein concentration was sufficient to increase the similarity of lipid abundances between cell line replicates. Data-based normalizations increased these similarities but resulted in a bias towards the large and variable lipid class of triglycerides. These artifacts are reduced by normalizing for the abundance of only structural lipids. We conclude that there is a delicate balance between improving the similarity between replicates and avoiding artifacts in lipidomic data and emphasize the importance of an appropriate normalization strategy in studying biological phenomena using lipidomics.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 1","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2024-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.3636","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemometrics","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cem.3636","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOCIAL WORK","Score":null,"Total":0}
引用次数: 0
Abstract
Sample collection can significantly affect lipid concentration measurements in cell line panels, concealing intrinsic differences between cancer subtypes. Most quality control steps in lipidomic data analysis focus on controlling technical variation. Correcting for the total amount of biological material remains an additional challenge for cell line panels. Here, we investigated how we can normalize lipidomic data acquired from multiple cell lines to correct for differences in sample biomass. We studied how commonly used data normalization and transformation strategies influence the resulting lipid data distributions. We compared normalization by biological properties including cell count and total protein concentration, to statistical and data-based approaches, such as median, mean, or probabilistic quotient-based normalization. We used intraclass correlations to estimate how normalization influenced the similarity between replicates. Normalizing lipidomic data by cell count improved the similarity between replicates but only for cell lines with similar morphologies. When comparing cell line panels with diverse morphologies neither cell count nor protein concentration was sufficient to increase the similarity of lipid abundances between cell line replicates. Data-based normalizations increased these similarities but resulted in a bias towards the large and variable lipid class of triglycerides. These artifacts are reduced by normalizing for the abundance of only structural lipids. We conclude that there is a delicate balance between improving the similarity between replicates and avoiding artifacts in lipidomic data and emphasize the importance of an appropriate normalization strategy in studying biological phenomena using lipidomics.
期刊介绍:
The Journal of Chemometrics is devoted to the rapid publication of original scientific papers, reviews and short communications on fundamental and applied aspects of chemometrics. It also provides a forum for the exchange of information on meetings and other news relevant to the growing community of scientists who are interested in chemometrics and its applications. Short, critical review papers are a particularly important feature of the journal, in view of the multidisciplinary readership at which it is aimed.