Journal of Chemometrics最新文献

筛选
英文 中文
Determination of Halitosis by Exhaled Breath Analysis Using Semiconductor Metal Oxide Sensors and Chemometric Methods
IF 2.3 4区 化学
Journal of Chemometrics Pub Date : 2025-02-17 DOI: 10.1002/cem.70012
Mikhail Saveliev, Andrey Volchek, Galina Lavrenova, Ol'ga Malay, Mikhail Grevtsev, Igor Jahatspanian
{"title":"Determination of Halitosis by Exhaled Breath Analysis Using Semiconductor Metal Oxide Sensors and Chemometric Methods","authors":"Mikhail Saveliev,&nbsp;Andrey Volchek,&nbsp;Galina Lavrenova,&nbsp;Ol'ga Malay,&nbsp;Mikhail Grevtsev,&nbsp;Igor Jahatspanian","doi":"10.1002/cem.70012","DOIUrl":"https://doi.org/10.1002/cem.70012","url":null,"abstract":"<div>\u0000 \u0000 <p>Halitosis is a condition associated with bad breath. Although halitosis is a disease in its own right, it is often a symptom of more serious diseases (diabetes mellitus, renal failure, azotemia, etc.). The currently used method for diagnosing halitosis is the organoleptic method, which relies on a trained specialist evaluating the patient's breath odor. This approach to diagnosing halitosis is subjective, uncomfortable for both patient and doctor, and necessitates the involvement of a specially trained professional. As an alternative, instrumental diagnostics employing metal oxide semiconductor (MOS) sensor arrays offer a promising avenue by enabling patient classification through predeveloped models. This paper considers the application of seven MOS sensors of different compositions at three different temperatures. Different methods of chemometric data analysis were applied: <i>k</i>-nearest neighbors (kNN), decision trees (DT), support vector machine (SVM), logistic regression (LR), and projection on latent structures discrimination analysis (PLSDA). All applied methods demonstrated their effectiveness and achieved selectivity, sensitivity, and accuracy values exceeding 85%. Additionally, a combined classifier leveraging responses from all previously studied classifiers was explored, achieving near-perfect classification accuracy.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 2","pages":""},"PeriodicalIF":2.3,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143431714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multiple Linear Regression–Based Algorithm to Correct for Cosmic Rays in Raman Images
IF 2.3 4区 化学
Journal of Chemometrics Pub Date : 2025-02-17 DOI: 10.1002/cem.70000
Hery Mitsutake, Eneida de Paula, Heloisa N. Bordallo, Douglas N. Rutledge
{"title":"A Multiple Linear Regression–Based Algorithm to Correct for Cosmic Rays in Raman Images","authors":"Hery Mitsutake,&nbsp;Eneida de Paula,&nbsp;Heloisa N. Bordallo,&nbsp;Douglas N. Rutledge","doi":"10.1002/cem.70000","DOIUrl":"https://doi.org/10.1002/cem.70000","url":null,"abstract":"<p>Raman imaging is a powerful technique for simultaneously obtaining chemical and spatial information on diverse materials. One of the most common detectors used on Raman equipment is the charge coupled detector (CCD) due its high sensitivity. However, CCDs are also sensitive to cosmic rays, that generate very narrow and intense signals: cosmic ray spikes. Since these peaks can be very intense and numerous, it is important to eliminate them before any data analysis. Some methods to do this use comparison of neighboring pixels to identify spikes, but when using the line-scanning acquisition mode, it is common that these spikes appear in two or more pixels close together. Thus, in this work, a new algorithm has been developed to correct for cosmic ray spikes in Raman images, based on multiple linear regression (MLR). This algorithm takes less than 1 min in images with more than 70,000 spectra and removes all spikes, even those at low intensity.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 2","pages":""},"PeriodicalIF":2.3,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70000","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143431713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal Stacked Modeling for Simultaneous Detection of Nutrient Concentrations With Turbidity Correction
IF 2.3 4区 化学
Journal of Chemometrics Pub Date : 2025-02-17 DOI: 10.1002/cem.70009
Meryem Nini, Mohamed Nohair
{"title":"Multimodal Stacked Modeling for Simultaneous Detection of Nutrient Concentrations With Turbidity Correction","authors":"Meryem Nini,&nbsp;Mohamed Nohair","doi":"10.1002/cem.70009","DOIUrl":"https://doi.org/10.1002/cem.70009","url":null,"abstract":"<div>\u0000 \u0000 <p>In this paper, an innovative method for the simultaneous determination of nitrite, nitrate, and COD in water in the presence of turbidity as a source of noise in spectroscopic data has been investigated. UV–Vis absorption spectrometry and advanced machine learning are proposed to develop a stacking model, a sophisticated modeling approach that combines several basic models (PLS, Lasso, and Ridge regression) and a meta-regressor (Random Forest regressor) to improve prediction accuracy by incorporating baseline correction and principal component analysis (PCA) to mitigate the effects of turbidity on spectroscopic data. After applying these corrections, a significant improvement was observed: The root mean square error (RMSE) and the mean absolute error (MAE) were significantly reduced, and the correlation coefficient (<i>R</i><sup>2</sup>) between predicted and actual values of nitrite, nitrate, COD, and turbidity was greater than 0.96, for all compounds in the test data set, that demonstrate the ability of the proposed stacking model to accurately predict nutrient concentrations simultaneously, even in complex environments; the proposed model may provide a valuable alternative to wet chemical methods. Due to its high accuracy and fast response, the proposed model can be used as an algorithm for the construction of nutrient sensors. This paper highlights the importance of integrating advanced modeling and data correction techniques to improve the robustness and accuracy of predictive models in environmental chemistry, thus providing valuable information for environmental monitoring and management.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 3","pages":""},"PeriodicalIF":2.3,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143431375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Progress of Complex System Process Analysis Based on Modern Spectroscopy Combined With Chemometrics
IF 2.3 4区 化学
Journal of Chemometrics Pub Date : 2025-02-11 DOI: 10.1002/cem.70006
Maogang Li, Qi Cai, Tianlong Zhang, Hongsheng Tang, Hua Li
{"title":"Progress of Complex System Process Analysis Based on Modern Spectroscopy Combined With Chemometrics","authors":"Maogang Li,&nbsp;Qi Cai,&nbsp;Tianlong Zhang,&nbsp;Hongsheng Tang,&nbsp;Hua Li","doi":"10.1002/cem.70006","DOIUrl":"https://doi.org/10.1002/cem.70006","url":null,"abstract":"<div>\u0000 \u0000 <p>In recent years, the role of analytical chemistry has undergone a gradual transformation, evolving from a mere participant to a pivotal decision-maker in process optimisation. This shift can be attributed to the advent of sophisticated analytical instrumentation, which has ushered in a new era of analytical capabilities. This article presents a review of the developments in the application of intelligent analysis techniques, including infrared (IR) spectroscopy, Raman spectroscopy, and laser-induced breakdown spectroscopy (LIBS), in the processing of complex systems over the past decade. The review provides an introduction to the fundamental principles of these analytical techniques and examines the evolution of their instrumentation to accommodate online process monitoring. The analysis of spectral data in complex system processes represents a fundamental aspect of the attainment of on-site quality monitoring, process optimisation and control. Accordingly, the review provides a comprehensive overview of the methodologies employed in process chemometrics, encompassing spectral preprocessing, feature selection, modelling techniques, and optimisation strategies for model performance. Furthermore, this article presents a summary of three intelligent spectral analysis tools, namely infrared spectroscopy, Raman spectroscopy, and LIBS, which are widely employed in process simulation, monitoring, optimisation, and control across multiple disciplines, including the environment, energy, biology, and food. The objective of this review is to provide a valuable reference point and guidance for the further promotion and utilisation of spectral intelligent analysis instruments, with the aim of promoting their in-depth application and development in a greater number of fields.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 2","pages":""},"PeriodicalIF":2.3,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143380521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of Chemometric Explorative Multi-Omics Data Analysis Methods Applied to a Mechanistic Pan-Cancer Cell Model
IF 2.3 4区 化学
Journal of Chemometrics Pub Date : 2025-02-11 DOI: 10.1002/cem.70001
J. A. Westerhuis, A. Heintz-Buschart, H. C. J. Hoefsloot, F. M. van der Kloet, G. R. van der Ploeg, F. T. G. White
{"title":"Comparison of Chemometric Explorative Multi-Omics Data Analysis Methods Applied to a Mechanistic Pan-Cancer Cell Model","authors":"J. A. Westerhuis,&nbsp;A. Heintz-Buschart,&nbsp;H. C. J. Hoefsloot,&nbsp;F. M. van der Kloet,&nbsp;G. R. van der Ploeg,&nbsp;F. T. G. White","doi":"10.1002/cem.70001","DOIUrl":"https://doi.org/10.1002/cem.70001","url":null,"abstract":"<p>The analysis of single cell multi-omics data is a complex task, and many explorative data analysis methods are being used to draw information from such data. This paper compares several of these methods to visualize the output of a mechanistic model under various simulated conditions. The analysis methods include PCA, PARAFAC, ASCA, MASCARA, COVSCA, P-ESCA, and PE-ASCA. These techniques, applied to high-dimensional data such as gene expression and protein levels, assess correlations across time series and experimental conditions. The study uses a complex mechanistic model of MCF10A cancer cells, simulating interactions between signaling pathways related to cell growth and division. Results show that while methods like PCA PARAFAC and ASCA reveal time-dependent variations in protein data, mRNA data exhibit minimal systematic variation. MASCARA offers unique insights by identifying genes linked to specific pathways. This work highlights the potential and limitations of various data analysis methods in understanding multi-omics data, particularly in single-cell contexts where experimental variation and stochastic processes complicate interpretation.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 2","pages":""},"PeriodicalIF":2.3,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143389009","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving Vapor Pressure Prediction Through Integration of Multiple Molecular Representations: A Super Learner Approach
IF 2.3 4区 化学
Journal of Chemometrics Pub Date : 2025-02-10 DOI: 10.1002/cem.70003
Ji Hyun Nam, Seul Lee, Seongil Jo, Jaeoh Kim, Jooyeon Lee, Jahyun Koo, Byounghwak Lee, Keunhong Jeong, Donghyeon Yu
{"title":"Improving Vapor Pressure Prediction Through Integration of Multiple Molecular Representations: A Super Learner Approach","authors":"Ji Hyun Nam,&nbsp;Seul Lee,&nbsp;Seongil Jo,&nbsp;Jaeoh Kim,&nbsp;Jooyeon Lee,&nbsp;Jahyun Koo,&nbsp;Byounghwak Lee,&nbsp;Keunhong Jeong,&nbsp;Donghyeon Yu","doi":"10.1002/cem.70003","DOIUrl":"https://doi.org/10.1002/cem.70003","url":null,"abstract":"<div>\u0000 \u0000 <p>Accurate prediction of vapor pressure is essential in chemical engineering, environmental science, and pharmaceutical development, impacting the volatility and stability of compounds. Traditional methods often fall short for complex and new molecular structures. This study introduces an advanced machine learning approach, integrating graph neural networks (GNNs), and CHEM-BERT models to improve prediction accuracy. Utilizing the largest dataset to date, we derived comprehensive chemical descriptors and fingerprints. We evaluated 19 predictive models, including ridge regression, random forest, support vector regression, and feed-forward neural networks, trained on diverse features like PaDEL and Morgan fingerprints, chemical descriptors, and Chem-BERT embeddings. Central to our methodology is the super learner architecture, which combines 19 multiple models to enhance accuracy. The super learner achieved a root mean squared error (RMSE) of 0.8200, outperforming individual models and previous reports. These successful results highlight the effectiveness of integrating GNNs and Chem-BERT for capturing detailed molecular information, setting a new benchmark for vapor pressure prediction. This study underscores the value of advanced machine learning techniques and comprehensive datasets, offering a robust tool for researchers and paving the way for future advancements in chemical property prediction.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 2","pages":""},"PeriodicalIF":2.3,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143380352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonparametric Threshold Estimation of Autocorrelated Statistics in Multivariate Statistical Process Monitoring
IF 2.3 4区 化学
Journal of Chemometrics Pub Date : 2025-02-05 DOI: 10.1002/cem.70004
Taylor R. Grimm, Kathryn B. Newhart, Amanda S. Hering
{"title":"Nonparametric Threshold Estimation of Autocorrelated Statistics in Multivariate Statistical Process Monitoring","authors":"Taylor R. Grimm,&nbsp;Kathryn B. Newhart,&nbsp;Amanda S. Hering","doi":"10.1002/cem.70004","DOIUrl":"https://doi.org/10.1002/cem.70004","url":null,"abstract":"<div>\u0000 \u0000 <p>Multivariate statistical process monitoring is commonly used to detect abnormal process behavior in real time. Multiple process variables are monitored simultaneously, and alarms are issued when monitoring statistics exceed a predetermined threshold. Traditional approaches use a parametric threshold based on the assumptions of independence and multivariate normality of the process data, which are often violated in complex processes with high sampling frequencies, leading to excessive false alarms. Some approaches for improved threshold selection have been proposed, but they assume independence of the monitoring statistics, which are often autocorrelated. In this paper, we compare the performance of nonparametric estimators for computing thresholds from autocorrelated monitoring statistics through simulation. The false alarm rate and in-control average run length of each estimator under different distributions, sample sizes, and autocorrelation levels and types are found. Estimator performance is found to depend on sample size and the strength of autocorrelation. The class of kernel density estimation (KDE) methods tends to perform better than estimators that use bootstrapping, and the proposed adjusted KDE methods that account for autocorrelation are recommended for general use. A case study to monitor a wastewater treatment facility further illustrates the performance of nonparametric and parametric thresholds when applied to real-world systems.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 2","pages":""},"PeriodicalIF":2.3,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143248476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cell Culture Media and Raman Spectra Preprocessing Procedures Impact Glucose Chemometrics
IF 2.3 4区 化学
Journal of Chemometrics Pub Date : 2025-02-05 DOI: 10.1002/cem.70005
Naresh Pavurala, Chikkathur N. Madhavarao, Jaeweon Lee, Jayanti Das, Muhammad Ashraf, Thomas O'Connor
{"title":"Cell Culture Media and Raman Spectra Preprocessing Procedures Impact Glucose Chemometrics","authors":"Naresh Pavurala,&nbsp;Chikkathur N. Madhavarao,&nbsp;Jaeweon Lee,&nbsp;Jayanti Das,&nbsp;Muhammad Ashraf,&nbsp;Thomas O'Connor","doi":"10.1002/cem.70005","DOIUrl":"https://doi.org/10.1002/cem.70005","url":null,"abstract":"<div>\u0000 \u0000 <p>Deployment of process analytical technology tools such as Raman or IR spectroscopy and associated multivariate calibration models for process monitoring and control plays an important role in process automation and advanced manufacturing of pharmaceuticals. Preprocessing or preparation of the spectroscopic data is an important step in developing a multivariate calibration model. There are several ways available to preprocess the data and each may influence the calibration model performance differently. Here we investigated the influence of preprocessing procedures on the development and performance of the chemometric models to predict the glucose concentration in a bioreactor. Box–Behnken design of experiment (DOE) was used to generate the Raman spectroscopy data. Four factors were considered critical in the DOE—glucose, glutamine, glutamic acid, and antifoam concentration. Raman spectroscopy data were collected both with and without aeration conditions, independently from three cell culture media. For each medium, data consisted of calibration set (27 conditions) and model validation set (9 conditions) separately. Additionally, Raman data was also collected for certain DOE runs with increasing concentration of cell densities ranging from 0.5 × 10 E06/mL to 30 × 10 E06/mL under aerating conditions. Data from the three cell culture media were used separately to develop calibration models that used four different preprocessing procedures, namely, baseline correction (BLC), Savitzky–Golay smoothing (SGS), Savitzky–Golay derivative (SGD) and orthogonal signal correction (OSC). The preprocessing procedures were applied individually and in combinations to evaluate the calibration model parameters and the performance metrics. We further developed glucose calibration models based on partial least squares (PLS) regression with 1–3 principal components. The models developed with OSC procedure gave superior performance metrics with just one principal component across all three media. Models developed with other preprocessing procedures required two or more principal components to give comparable performance. Overall, the choice of preprocessing procedures affected the model performance.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 2","pages":""},"PeriodicalIF":2.3,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143248478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Alignment-Agnostic Methodology for the Analysis of Designed Separations Data
IF 2.3 4区 化学
Journal of Chemometrics Pub Date : 2025-01-25 DOI: 10.1002/cem.70002
Michael Sorochan Armstrong, José Camacho
{"title":"An Alignment-Agnostic Methodology for the Analysis of Designed Separations Data","authors":"Michael Sorochan Armstrong,&nbsp;José Camacho","doi":"10.1002/cem.70002","DOIUrl":"https://doi.org/10.1002/cem.70002","url":null,"abstract":"<div>\u0000 \u0000 <p>Chemical separations data are typically analyzed in the time domain using methods that integrate the discrete elution bands. Integrating the same chemical components across several samples must account for retention time drift over the course of an entire experiment as the physical characteristics of the separation are altered through several cycles of use. Failure to consistently integrate the components within a matrix of <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>M</mi>\u0000 <mo>×</mo>\u0000 <mi>N</mi>\u0000 </mrow>\u0000 <annotation>$$ Mtimes N $$</annotation>\u0000 </semantics></math> samples and variables creates artifacts that have a profound effect on the analysis and interpretation of the data. This work presents an alternative where the raw separations data are analyzed in the frequency domain to account for the offset of the chromatographic peaks as a matrix of complex Fourier coefficients. We present a generalization of the factorization, permutation testing, and visualization steps in ANOVA-simultaneous component analysis (ASCA) to handle complex matrices and use this method to analyze a synthetic dataset with known significant factors and compare the interpretation of a real dataset via its peak table and frequency domain representations.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 2","pages":""},"PeriodicalIF":2.3,"publicationDate":"2025-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143119021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Greener, Safer, and More Understandable AI for Natural Science and Technology
IF 2.3 4区 化学
Journal of Chemometrics Pub Date : 2025-01-18 DOI: 10.1002/cem.3643
Harald Martens
{"title":"A Greener, Safer, and More Understandable AI for Natural Science and Technology","authors":"Harald Martens","doi":"10.1002/cem.3643","DOIUrl":"https://doi.org/10.1002/cem.3643","url":null,"abstract":"<p>More rational, open-minded use of quantitative Big Data in Science and Technology is required for better real-world problem solving as well as for the stabilization of shared belief structures in society. Modern instrumentation gives informative but overwhelming data streams. A thermal video camera with suitable spatiotemporal subspace modeling allows us to detect surface temperature changes of, for example, engines, that can reveal something going on inside. An RGB video camera responds to both motions and color changes in nature, often with spatiotemporal change patterns that we can discover and describe mathematically, validate statistically, interpret graphically, and then use for sensible things. A hyperspectral Vis./NIR satellite camera with hundreds of wavelengths reveals changes in clouds and at each earth location, again and again. Today we know how to decode such overwhelming streams of high-dimensional data into physical and chemical causalities by minimalistic hybrid multivariate subspace models. We thereby combine prior knowledge with the ability to discover new, reliable variation patterns. Minimalistic subspace models handle such data. These “open-ended” multivariate linear hybrid models are computationally fast, statistically safe, and graphically understandable. The minimalistic subspace models are therefore suitable for both data modeling (based on multivariate measurements) and metamodeling (based on input–output simulation results for nonlinear mechanistic models' behavioral repertoire). That makes it easier to combine high-dimensional streams of real-world measurements and complicated, slow mechanistic models. Implemented as minimalistic foundation models with hierarchies of extended subspace models, this can form a basis for faster discovery and problem solving in Natural Science &amp; Technology.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 2","pages":""},"PeriodicalIF":2.3,"publicationDate":"2025-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.3643","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143116402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信