George Liang, Sha Sha, Zhao Wang, Huolong Liu, Seongkyu Yoon
{"title":"Soft-sensor model development for CHO growth/production, intracellular metabolite, and glycan predictions.","authors":"George Liang, Sha Sha, Zhao Wang, Huolong Liu, Seongkyu Yoon","doi":"10.3389/fmolb.2024.1441885","DOIUrl":null,"url":null,"abstract":"<p><p>Efficaciously assessing product quality remains time- and resource-intensive. Online Process Analytical Technologies (PATs), encompassing real-time monitoring tools and soft-sensor models, are indispensable for understanding process effects and real-time product quality. This research study evaluated three modeling approaches for predicting CHO cell growth and production, metabolites (extracellular, nucleotide sugar donors (NSD) and glycan profiles): Mechanistic based on first principle Michaelis-Menten kinetics (MMK), data-driven orthogonal partial least square (OPLS) and neural network machine learning (NN). Our experimental design involved galactose-fed batch cultures. MMK excelled in predicting growth and production, demonstrating its reliability in these aspects and reducing the data burden by requiring fewer inputs. However, it was less precise in simulating glycan profiles and intracellular metabolite trends. In contrast, NN and OPLS performed better for predicting precise glycan compositions but displayed shortcomings in accurately predicting growth and production. We utilized time in the training set to address NN and OPLS extrapolation challenges. OPLS and NN models demanded more extensive inputs with similar intracellular metabolite trend prediction. However, there was a significant reduction in time required to develop these two models. The guidance presented here can provide valuable insight into rapid development and application of soft-sensor models with PATs for ipurposes. Therefore, we examined three model typesmproving real-time product CHO therapeutic product quality. Coupled with emerging -omics technologies, NN and OPLS will benefit from massive data availability, and we foresee more robust prediction models that can be advantageous to kinetic or partial-kinetic (hybrid) models.</p>","PeriodicalId":12465,"journal":{"name":"Frontiers in Molecular Biosciences","volume":"11 ","pages":"1441885"},"PeriodicalIF":3.9000,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11535473/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Molecular Biosciences","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fmolb.2024.1441885","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Efficaciously assessing product quality remains time- and resource-intensive. Online Process Analytical Technologies (PATs), encompassing real-time monitoring tools and soft-sensor models, are indispensable for understanding process effects and real-time product quality. This research study evaluated three modeling approaches for predicting CHO cell growth and production, metabolites (extracellular, nucleotide sugar donors (NSD) and glycan profiles): Mechanistic based on first principle Michaelis-Menten kinetics (MMK), data-driven orthogonal partial least square (OPLS) and neural network machine learning (NN). Our experimental design involved galactose-fed batch cultures. MMK excelled in predicting growth and production, demonstrating its reliability in these aspects and reducing the data burden by requiring fewer inputs. However, it was less precise in simulating glycan profiles and intracellular metabolite trends. In contrast, NN and OPLS performed better for predicting precise glycan compositions but displayed shortcomings in accurately predicting growth and production. We utilized time in the training set to address NN and OPLS extrapolation challenges. OPLS and NN models demanded more extensive inputs with similar intracellular metabolite trend prediction. However, there was a significant reduction in time required to develop these two models. The guidance presented here can provide valuable insight into rapid development and application of soft-sensor models with PATs for ipurposes. Therefore, we examined three model typesmproving real-time product CHO therapeutic product quality. Coupled with emerging -omics technologies, NN and OPLS will benefit from massive data availability, and we foresee more robust prediction models that can be advantageous to kinetic or partial-kinetic (hybrid) models.
有效评估产品质量仍然需要大量的时间和资源。在线过程分析技术(PAT)包括实时监测工具和软传感器模型,是了解过程效应和实时产品质量所不可或缺的。这项研究评估了三种预测 CHO 细胞生长和生产、代谢物(细胞外、核苷酸供糖体 (NSD) 和糖概况)的建模方法:基于第一原理的迈克尔-门顿动力学(MMK)、数据驱动的正交偏最小二乘法(OPLS)和神经网络机器学习(NN)。我们的实验设计涉及半乳糖喂养的批量培养。MMK 在预测生长和产量方面表现出色,证明了其在这些方面的可靠性,并通过减少输入量减轻了数据负担。然而,它在模拟糖谱和细胞内代谢物趋势方面不够精确。相比之下,NN 和 OPLS 在预测精确的糖组成方面表现较好,但在准确预测生长和产量方面存在不足。我们利用训练集中的时间来解决 NN 和 OPLS 的外推难题。OPLS 和 NN 模型需要更广泛的输入,但对细胞内代谢物趋势的预测却相似。不过,开发这两种模型所需的时间大大减少。这里介绍的指南可以为快速开发和应用具有 PAT 的软传感器模型提供有价值的见解。因此,我们研究了证明实时产品 CHO 治疗产品质量的三种模型类型。与新兴的组学技术相结合,NN 和 OPLS 将从海量数据的可用性中获益,我们预见到更强大的预测模型将比动力学或部分动力学(混合)模型更具优势。
期刊介绍:
Much of contemporary investigation in the life sciences is devoted to the molecular-scale understanding of the relationships between genes and the environment — in particular, dynamic alterations in the levels, modifications, and interactions of cellular effectors, including proteins. Frontiers in Molecular Biosciences offers an international publication platform for basic as well as applied research; we encourage contributions spanning both established and emerging areas of biology. To this end, the journal draws from empirical disciplines such as structural biology, enzymology, biochemistry, and biophysics, capitalizing as well on the technological advancements that have enabled metabolomics and proteomics measurements in massively parallel throughput, and the development of robust and innovative computational biology strategies. We also recognize influences from medicine and technology, welcoming studies in molecular genetics, molecular diagnostics and therapeutics, and nanotechnology.
Our ultimate objective is the comprehensive illustration of the molecular mechanisms regulating proteins, nucleic acids, carbohydrates, lipids, and small metabolites in organisms across all branches of life.
In addition to interesting new findings, techniques, and applications, Frontiers in Molecular Biosciences will consider new testable hypotheses to inspire different perspectives and stimulate scientific dialogue. The integration of in silico, in vitro, and in vivo approaches will benefit endeavors across all domains of the life sciences.