Selina Ramm, T. H. Rodríguez, Björn Frahm, M. Pein-Hackelbusch
{"title":"介质光谱数据的系统预处理与活细胞密度估算","authors":"Selina Ramm, T. H. Rodríguez, Björn Frahm, M. Pein-Hackelbusch","doi":"10.1109/INDIN51400.2023.10218012","DOIUrl":null,"url":null,"abstract":"For process monitoring, an adequate data preprocessing is crucial to link accessible inline process data with offline measured target variables. Literature, however, does not provide systematic preprocessing strategies. The effects of five different preprocessing strategies on data from a Dielectric Spectroscopy system applied to the Viable Cell Density (VCD) of a mammalian cell cultivation were thus evaluated. Single-frequency measurements are typically used to model the VCD over the growth phase using linear regression or the Cole-Cole model and served as a reference. As multi-frequency measurement is promising to model the VCD beyond the growth phase using Partial Least Squares Regression (PLSR), we further aimed to determine, whether replacing linear regression by PLSR shows comparable modeling performance. All five preprocessing strategies led to comparable results. Exemplary, when using capacitance values at a frequency of 3347 kHz, linear regression resulted in a $\\mathrm{R}^{2}$ of 0.90 and a standard deviation of 0.4 % on average. Both normalization techniques had the same positive effect on the results of PLSR. The order of smoothing and normalization was irrelevant for both regression methods. Comparing the results of linear regression and PLSR, the latter obtained on average 9 % better results. Therefore, we concluded that PLSR is preferable over linear regression and is potentially suitable to model the VCD beyond the growth phase, which is suggested to be investigated based on more data sets.","PeriodicalId":174443,"journal":{"name":"2023 IEEE 21st International Conference on Industrial Informatics (INDIN)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Systematic Preprocessing of Dielectric Spectroscopy Data and Estimating Viable Cell Densities\",\"authors\":\"Selina Ramm, T. H. Rodríguez, Björn Frahm, M. Pein-Hackelbusch\",\"doi\":\"10.1109/INDIN51400.2023.10218012\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For process monitoring, an adequate data preprocessing is crucial to link accessible inline process data with offline measured target variables. Literature, however, does not provide systematic preprocessing strategies. The effects of five different preprocessing strategies on data from a Dielectric Spectroscopy system applied to the Viable Cell Density (VCD) of a mammalian cell cultivation were thus evaluated. Single-frequency measurements are typically used to model the VCD over the growth phase using linear regression or the Cole-Cole model and served as a reference. As multi-frequency measurement is promising to model the VCD beyond the growth phase using Partial Least Squares Regression (PLSR), we further aimed to determine, whether replacing linear regression by PLSR shows comparable modeling performance. All five preprocessing strategies led to comparable results. Exemplary, when using capacitance values at a frequency of 3347 kHz, linear regression resulted in a $\\\\mathrm{R}^{2}$ of 0.90 and a standard deviation of 0.4 % on average. Both normalization techniques had the same positive effect on the results of PLSR. The order of smoothing and normalization was irrelevant for both regression methods. Comparing the results of linear regression and PLSR, the latter obtained on average 9 % better results. Therefore, we concluded that PLSR is preferable over linear regression and is potentially suitable to model the VCD beyond the growth phase, which is suggested to be investigated based on more data sets.\",\"PeriodicalId\":174443,\"journal\":{\"name\":\"2023 IEEE 21st International Conference on Industrial Informatics (INDIN)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE 21st International Conference on Industrial Informatics (INDIN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INDIN51400.2023.10218012\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 21st International Conference on Industrial Informatics (INDIN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDIN51400.2023.10218012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Systematic Preprocessing of Dielectric Spectroscopy Data and Estimating Viable Cell Densities
For process monitoring, an adequate data preprocessing is crucial to link accessible inline process data with offline measured target variables. Literature, however, does not provide systematic preprocessing strategies. The effects of five different preprocessing strategies on data from a Dielectric Spectroscopy system applied to the Viable Cell Density (VCD) of a mammalian cell cultivation were thus evaluated. Single-frequency measurements are typically used to model the VCD over the growth phase using linear regression or the Cole-Cole model and served as a reference. As multi-frequency measurement is promising to model the VCD beyond the growth phase using Partial Least Squares Regression (PLSR), we further aimed to determine, whether replacing linear regression by PLSR shows comparable modeling performance. All five preprocessing strategies led to comparable results. Exemplary, when using capacitance values at a frequency of 3347 kHz, linear regression resulted in a $\mathrm{R}^{2}$ of 0.90 and a standard deviation of 0.4 % on average. Both normalization techniques had the same positive effect on the results of PLSR. The order of smoothing and normalization was irrelevant for both regression methods. Comparing the results of linear regression and PLSR, the latter obtained on average 9 % better results. Therefore, we concluded that PLSR is preferable over linear regression and is potentially suitable to model the VCD beyond the growth phase, which is suggested to be investigated based on more data sets.