Mattia Sozzi , Nicola Cavallini , Alessandro Chiadò , Gentian Gavoci , Enrico Cantaluppi , Filip Haxhari , Francesco Savorani
{"title":"A new versatile algorithm to extract particle’s features from FESEM images: method evaluation and a case study on rice kernels","authors":"Mattia Sozzi , Nicola Cavallini , Alessandro Chiadò , Gentian Gavoci , Enrico Cantaluppi , Filip Haxhari , Francesco Savorani","doi":"10.1016/j.chemolab.2025.105415","DOIUrl":"10.1016/j.chemolab.2025.105415","url":null,"abstract":"<div><div>Image analysis approaches allow to quickly extract important information from images of diverse nature. Many techniques produce as a result images that contain regular and irregular objects. The ability of automatically extracting the objects and their related morphological features and properties is becoming fundamental, especially when the number of images to analyse is consistent.</div><div>In this context, a new algorithm able to extract a series of morphological features from FESEM images was developed. Starting from a case study on 54 varieties of rice kernels, 220 images were acquired, and the algorithm was coded with the aim of extracting information from the round-shaped starch particles naturally present in rice kernels. The algorithm constitutes of different steps to segment the images and identify the object shapes and boundaries. Once those objects are identified, the algorithm extracts their morphological features, the number of identified objects and the amount of empty spaces among those objects.</div><div>The developed algorithm is suitable for a rapid and automated analysis of several images, with the aim of extracting object-related morphological features and information about the general objects space disposition. The use of adaptive thresholds and correction steps allow to analyse images of different natures containing also defective and non-representative objects that will be automatically removed from the features calculation. In addition, to evaluate the algorithm performances, a Design of Experiment approach was developed to determine the effect of the input parameters choice on the algorithm output results, highlighting which parameters show a stronger effect on the output.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"263 ","pages":"Article 105415"},"PeriodicalIF":3.7,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143887709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Feyyaz Alpsalaz , Yıldırım Özüpak , Emrah Aslan , Hasan Uzel
{"title":"Classification of maize leaf diseases with deep learning: Performance evaluation of the proposed model and use of explicable artificial intelligence","authors":"Feyyaz Alpsalaz , Yıldırım Özüpak , Emrah Aslan , Hasan Uzel","doi":"10.1016/j.chemolab.2025.105412","DOIUrl":"10.1016/j.chemolab.2025.105412","url":null,"abstract":"<div><div>Maize leaf diseases pose significant threats to global agricultural productivity, yet traditional diagnostic methods are slow, subjective, and resource-intensive. This study proposes a lightweight and interpretable convolutional neural network (CNN) model for accurate and efficient classification of maize leaf diseases. Using the ‘Corn or Maize Leaf Disease Dataset’, the model classifies four disease categories Healthy, Gray Leaf Spot, Common Rust, and Northern Leaf Blight with 94.97 % accuracy and a micro-average AUC of 0.99. With only 1.22 million parameters, the model supports real-time inference on mobile devices, making it ideal for field applications. Data augmentation and transfer learning techniques were applied to ensure robust generalization. To enhance transparency and user trust, Explainable Artificial Intelligence (XAI) methods, including LIME and SHAP, were employed to identify disease-relevant features such as lesions and pustules, with SHAP achieving an IoU of 0.82. The proposed model outperformed benchmark models like ResNet50, MobileNetV2, and EfficientNetB0 in both accuracy and computational efficiency. Robustness tests under simulated environmental challenges confirmed its adaptability, with only a 2.82 % performance drop under extreme conditions. Comparative analyses validated its statistical significance and practical superiority. This model represents a reliable, fast, and explainable solution for precision agriculture, especially in resource-constrained environments. Future enhancements will include multi-angle imaging, multimodal inputs, and extended datasets to improve adaptability and scalability in real-world conditions.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"262 ","pages":"Article 105412"},"PeriodicalIF":3.7,"publicationDate":"2025-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143874963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Abhinav Abraham , Hadis Anahideh , Eric Mayhew , Kenneth Brezinsky , Patrick T. Lynch
{"title":"Evaluation of n-component surrogate mixtures formulated for jet fuel physicochemical property predictions","authors":"Abhinav Abraham , Hadis Anahideh , Eric Mayhew , Kenneth Brezinsky , Patrick T. Lynch","doi":"10.1016/j.chemolab.2025.105409","DOIUrl":"10.1016/j.chemolab.2025.105409","url":null,"abstract":"<div><div>Compression ignition (CI) engines operating on aviation fuels are sensitive to fuel property variation but can maintain robust ignition with inline, feedforward sensing and control. Due to the expensive and time-consuming nature of conventional fuel property testing methods, alternate methods like Quantitative Structure Property Relationship (QSPR) models have been developed, which link fuel properties to their molecular structures, but many fail to account for the complex intermolecular interactions in multicomponent mixtures like jet fuels. Recent studies have employed machine learning (ML) models trained on surrogate mixtures of jet fuels to predict fuel properties like the derived cetane number, density, and viscosity; and they show good performance. These surrogate mixtures, composed of hydrocarbons representing real fuel components, facilitate dataset generation for ML model training. However, the sufficiency of such datasets has not been studied exhaustively. This study investigates the sufficiency of datasets composed of jet fuel surrogate mixtures in predicting fuel properties for fuels beyond the training set. While dataset sufficiency does depend on the population and the property modeled, as a heuristic, we emphasize spanning the chemical functional groups present in likely mixtures and doing this with low correlation and high diversity in the chemical functional groups. A large dataset of surrogate mixtures was generated spanning the ranges of UNIversal Functional Activity Coefficients (UNIFAC) functional groups present in real jet fuels. This study establishes criteria for dataset sufficiency, suggesting a minimum dataset size for developing a robust ML model. We also report the tools and codes developed for determining the minimum number of surrogate mixtures needed for the purpose of training ML models given user defined UNIFAC functional group compositions and a palette of suitable neat hydrocarbon components. This research contributes to developing efficient, accurate ML models for predicting the properties of complex fuel mixtures, with applications in fuel formulation, sensing, and control.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"263 ","pages":"Article 105409"},"PeriodicalIF":3.7,"publicationDate":"2025-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143895185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Visualized correlation and distance preserving dimensionality reduction method","authors":"Zhonghai He , Zhanbo Feng , Haoxiang Zhang , Xiaofang Zhang","doi":"10.1016/j.chemolab.2025.105406","DOIUrl":"10.1016/j.chemolab.2025.105406","url":null,"abstract":"<div><div>A large of existing dimensionality reduction methods are aimed at preserving some properties of data, which cannot take label information into account. With the aim of reduced low-dimensional coordinate is used as tool for timing judgment of model updating, the concentration information should be incorporated into dimensionality reduction procedure, which is presented and named as Visualized Correlation and Distance Preserving dimensionality reduction method. To address the difficulty of 2D coordinate and 1D label correlation computation, pairwise distance matrices in both the subspace and label space are computed and the strictly lower triangular parts of these matrices are extracted and vectorized in column-major order, resulting in two vectors so that correlation can be computed. Distance preservation term is included as sub-objective function to ensure the low distance dissimilarity between high and low coordinates. To reduce structural loss caused by sequential dimensionality reduction method, the projection matrix is concatenated to vector then optimized to ensure projection vectors are optimized synchronously. PCA transformation is continued to adjust the reduced coordinates to better suited for visual judgment.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"262 ","pages":"Article 105406"},"PeriodicalIF":3.7,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143850369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Generalized continuum regression (GCR): An advanced multivariate method for precise dimensionality reduction and efficient regression modeling","authors":"Yang Chen , Chonghui Dan , Yao He, Xiaoyuan Zheng","doi":"10.1016/j.chemolab.2025.105407","DOIUrl":"10.1016/j.chemolab.2025.105407","url":null,"abstract":"<div><div>The collinearity inherent in high-dimensional, low-sample-size (HDLSS) data critically undermines the accuracy of chemometric regression. We have proven that when predictor matrix <span><math><mrow><mi>X</mi></mrow></math></span> has full row rank, the optimal dimensionality of <em>X</em>-block latent variables (LVs) in multivariate linear regression equals <span><math><mrow><mi>r</mi><mi>a</mi><mi>n</mi><mi>k</mi><mrow><mo>(</mo><mi>Y</mi><mo>)</mo></mrow></mrow></math></span>. Based on this theoretical foundation, we develop generalized continuum regression (GCR), an advanced multivariate regression method rooted in continuum canonical correlation (CCC). GCR's core innovation lies in the extension of CCC's scalar parameter <span><math><mrow><mi>α</mi></mrow></math></span> to a vector form for precise dimension reduction in multivariate regression. We also develop an efficient numeric algorithm for computational speed. Real-world implementations on two spectroscopic datasets confirm that GCR adopts LVs with dimensionality equal to the rank of <span><math><mrow><mi>Y</mi></mrow></math></span>, validating its precise dimensionality reduction. When compared to CCC regression (CCCR), GCR exhibits superior performance with: (1) a 7.28 %–43.70 % reduction in mean-squared error for validation (MSEV) when utilizing two or three latent variables (LVs); and (2) a 30 to 55-fold increase in solution speed. These findings highlight GCR's potential as a valuable tool for dimensionality reduction and regression modeling in chemometrics, specifically for HDLSS analysis.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"262 ","pages":"Article 105407"},"PeriodicalIF":3.7,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143878346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Walter M. Warren-Vega , Sofia Cornejo-León , Ana I. Zárate-Guzmán , Francisco Carrasco-Marín , Luis A. Romero-Cano
{"title":"Chemometric modeling of the adsorption mechanism of Cu(II) in aqueous solution onto functionalized materials: Integrating artificial neural networks and porous structure characterization","authors":"Walter M. Warren-Vega , Sofia Cornejo-León , Ana I. Zárate-Guzmán , Francisco Carrasco-Marín , Luis A. Romero-Cano","doi":"10.1016/j.chemolab.2025.105405","DOIUrl":"10.1016/j.chemolab.2025.105405","url":null,"abstract":"<div><div>Traditional methods for evaluating adsorption mechanisms rely on material characterization and its linear relationship with adsorption capacity. However, this approach has limitations, as it assumes a linear correlation, and when this fails, it is often speculated that multiple mechanisms are involved without detailing their contributions. This study overcomes these challenges by using artificial intelligence to analyze the adsorption of Cu(II) onto alternative adsorbents. An Artificial Neural Network (ANN) combined with 3D porous texture simulations, based on mercury intrusion porosimetry, established non-linear correlations among 13 textural and chemical characteristics and adsorption capacity.</div><div>The material with the highest adsorption capacity (107 mg g<sup>−1</sup>) featured an accessible porous texture rich in –COOH groups. The ANN quantified the contributions of two governing mechanisms: diffusion through the porous texture (67.07 %) and interaction with –COOH sites (32.93 %). Chemometric analysis revealed that the greatest weight in the ANN model was attributed to the average pore diameter (17.11 %), which was consistent with the characterization of the saturated material by SEM-EDX, showing that adsorption occurs primarily in the exposed cavities of the material.</div><div>The adsorption mechanism proposed by the ANN study explains the atypical points observed in the different materials, showing that the adsorption process is governed by a combination of two mechanisms: one associated with the porous texture and the other with surface chemistry. The findings provide a deeper understanding of the key variables influencing adsorption and offer guidance for optimizing material synthesis.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"262 ","pages":"Article 105405"},"PeriodicalIF":3.7,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143844324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zeyu Hou , Bingxin Yan , Yuhan Zhao , Shengbo Zhang , Bo Su , Kai Li , Cunlin Zhang
{"title":"Spectral investigation of aspartame and acesulfame utilizing PXRD, Raman, FTIR, and THz technologies","authors":"Zeyu Hou , Bingxin Yan , Yuhan Zhao , Shengbo Zhang , Bo Su , Kai Li , Cunlin Zhang","doi":"10.1016/j.chemolab.2025.105408","DOIUrl":"10.1016/j.chemolab.2025.105408","url":null,"abstract":"<div><div>Artificial sweeteners, as a type of food additive, have vibration frequencies mostly concentrated in the terahertz (THz) band, which enables us to utilize THz technology to deeply analyze their molecular properties. To gain a more comprehensive understanding of the features of artificial sweeteners, this study specifically chose aspartame and acesulfame as representatives. Initially, we employed PXRD and Raman spectroscopy techniques to carry out a thorough examination and verification of the crystalline structure as well as the purity levels of these two synthetic sweeteners. Then, with the aid of Fourier transform infrared spectroscopy (FTIR) technology and terahertz time-domain spectroscopy (THz-TDS) system, the spectral characteristics of aspartame and acesulfame were precisely measured. Moreover, the crystal configurations of these two artificial sweeteners were simulated using solid-state density functional theory (DFT), and the simulation results were in good agreement with the experimental results, further validating the effectiveness of our research methods. Finally, using microfluidic chip technology, the THz spectral characteristics of aspartame and acesulfame in solution were determined, and were compared with their spectra in solid state. We found that the THz spectra of the two artificial sweeteners in solid and solution states have significant correlations. In addition, the research further elucidated that the THz spectrum of a substance dissolved in a solution exhibits a close correlation with its concentration within that solution. These findings provide new perspectives and value for our in-depth research on artificial sweeteners.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"262 ","pages":"Article 105408"},"PeriodicalIF":3.7,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143838713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Optimizing soft sensor costs through feature selection: A comparative study of sensory and chemical parameters in wine grade prediction","authors":"Jingxian An , Zhipeng Zhang","doi":"10.1016/j.chemolab.2025.105404","DOIUrl":"10.1016/j.chemolab.2025.105404","url":null,"abstract":"<div><div>Traditional wine grade evaluation, typically conducted by world-renowned wine experts, was found to disadvantage emerging wineries due to its restrictive and time-consuming nature. This study proposed an alternative approach using soft sensors to predict wine grades, investigating the cost-effectiveness of both chemical and sensory evaluation methods through various machine learning approaches. A dataset of 23 unique wine samples in duplicate (totaling 46 bottles of New Zealand Pinot Noir wines), classified across all five stars of the Jukes-Stelzer system, was analyzed using 13 chemical parameters and 35 sensory attributes. The research employed classification algorithms, including naïve Bayes, k-nearest neighbors, decision trees, and support vector machines, to predict wine grades. Additionally, multiple feature selection methods—such as PCA distance analysis, ensemble tree-based feature selection, decision tree-based feature selection, Fisher score, relief-F score analysis, and one-way ANOVA—were used to identify the most significant predictive variables while minimizing analytical costs. Results demonstrated that chemical parameters, particularly those related to wine color and total phenolics, served as strong indicators of wine grade, with soft sensors using all 13 chemical parameters achieving prediction accuracies up to 93.48 %. Sensory attributes, particularly oak influence and tertiary aromas related to wine storage, also proved to be effective predictors. Soft sensors utilizing all 35 sensory attributes achieved accuracies of 97.83 %. Through feature selection methods, costs could be reduced by up to 100 % while maintaining acceptable prediction accuracy (above 65 %). Similarly, accuracies above 65 % were achieved using sensory attributes as input data, alongside a 97 % cost reduction. Additionally, in scenarios where chemical measurements were taken only once and sensory attributes were evaluated by a single wine expert, a comparative cost analysis revealed that sensory attributes were more economical for high-accuracy predictions (>70 %), while chemical parameters proved more cost-effective for moderate accuracy levels (<70 %). For higher accuracy requirements (>70 %), sensory evaluation emerged as the optimal choice, offering both high accuracy and cost-effectiveness. This study proposed a practical framework for cost-effective wine grade prediction methods that could benefit both established and emerging wine producers, offering an accessible alternative to traditional expert-based evaluation systems.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"262 ","pages":"Article 105404"},"PeriodicalIF":3.7,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143858815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Miguel de Figueiredo , Serge Rudaz , Julien Boccard
{"title":"Integration of multifactorial omics data from several sources using multiblock methods","authors":"Miguel de Figueiredo , Serge Rudaz , Julien Boccard","doi":"10.1016/j.chemolab.2025.105403","DOIUrl":"10.1016/j.chemolab.2025.105403","url":null,"abstract":"<div><div>With advances in data acquisition methods and technical platforms, omics measurement collection yields increasingly complex data structures. While high-dimensional matrices with more variables than samples can be handled via multivariate methods, extracting information is more challenging in the case of experimental designs involving several factors. Multifactorial models combining ANOVA and multivariate approaches have been developed for this purpose, but analyzing unbalanced designs remains challenging, especially when several data blocks are integrated.</div><div>This study introduces integrative AComDim (iAComDim) and integrative AMOPLS (iAMOPLS) for the analysis of multifactorial data from multiple sources. These methods implement a rebalancing strategy tailored for multiblock settings, ensuring unbiased effect estimators and orthogonal effect matrices even with unbalanced designs. When applied to a multiomics benchmark dataset with two experimental factors, these approaches effectively separate the sources of variation related to the effects in the design while summarizing information into a single multiblock model. Rebalancing strategies prevent the mixing of variation sources in extracted components, and their integration with multiblock chemometric methods offers an efficient and versatile solution for analyzing complex data structures.</div><div>This work establishes a novel framework for analyzing data from single or multiple sources within multifactorial experimental designs. Furthermore, the proposed methods are flexible enough to analyze unbalanced designs with heterogeneously missing replicates across multiple tables, making them broadly applicable for handling multiomics or other datasets in various application domains.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"262 ","pages":"Article 105403"},"PeriodicalIF":3.7,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143833935","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"GAN-ML: Advancing anticancer peptide prediction through innovative Deep Convolution Generative Adversarial Network data augmentation technique","authors":"Sadik Bhattarai , Kil To Chong , Hilal Tayara","doi":"10.1016/j.chemolab.2025.105390","DOIUrl":"10.1016/j.chemolab.2025.105390","url":null,"abstract":"<div><div>Limited and imbalanced data hinder anticancer peptide (ACP) prediction, often resulting in over-fitting and poor performance on unseen peptides. To address these challenges, we propose a Deep Convolution Generative Adversarial Network (DC-GAN) based data augmentation method. This approach effectively expands the training dataset by generating peptides with anticancer properties, particularly underrepresented class such as N+ type ACPs, characterized by abundant positive residues in the N-terminus, which remain amnesic problem in anticancer peptide prediction. Compared to traditional methods like Synthetic Minority Over-sampling Technique (SMOTE) and SMOTE with Edited Nearest Neighbors (SMOTEENN), DC-GAN demonstrates superior performance by addressing both limited training samples and within-class imbalances, such as those between C+ and N+ type peptides. The proposed framework, GAN-ML cascade a linear model and an ensemble model, achieving accuracy rates of 82.96% (independent test), 96.06% (independent test), and 94.06% (5-fold cross-validation) for classifying peptides as anticancer, antimicrobial, or non-anticancer across various datasets integrating ACPs motif based authentication and physio-chemical properties based validation. These results highlight the efficacy of DC-GAN-based data augmentation in enhancing model generalization, improving performance by generating a samples with minority representation, and serving as a powerful tool for generative anticancer drug discovery.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"262 ","pages":"Article 105390"},"PeriodicalIF":3.7,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143816471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}