{"title":"Predicting the Solubility of Lignin via Machine Learning.","authors":"Changhang Zhang, Chenxin Sun, Xinyu Wu, Xiaoyu Li, Yunchang He, Hailan Lian","doi":"10.1021/acs.biomac.5c00874","DOIUrl":null,"url":null,"abstract":"<p><p>Lignin is a highly promising renewable resource, but its practical application faces challenges due to its polydispersity and variability in solubility. This study utilized real-world characterization data (gel permeation chromatography (GPC) and HSQC NMR) to construct the molecular structures of 100 lignins of varying molecular weights. We used a machine learning (ML) approach, combining structural features with quantum chemical information, to predict the solubilities of these lignins in various solvents (calculated using COSMOtherm software). The machine learning model demonstrated high accuracy (<i>R</i><sup>2</sup> values of 0.987, 0.892, and 0.970, respectively), demonstrating its effectiveness in predicting lignin solubility based on structure and solvent properties. Furthermore, SHAP analysis elucidated the influence of individual molecular features on solubility predictions, contributing to our understanding of how the lignin structure influences solubility. This study provides valuable insights into the selection of highly soluble green solvents and the preparation of monodisperse lignin.</p>","PeriodicalId":30,"journal":{"name":"Biomacromolecules","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomacromolecules","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.biomac.5c00874","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Lignin is a highly promising renewable resource, but its practical application faces challenges due to its polydispersity and variability in solubility. This study utilized real-world characterization data (gel permeation chromatography (GPC) and HSQC NMR) to construct the molecular structures of 100 lignins of varying molecular weights. We used a machine learning (ML) approach, combining structural features with quantum chemical information, to predict the solubilities of these lignins in various solvents (calculated using COSMOtherm software). The machine learning model demonstrated high accuracy (R2 values of 0.987, 0.892, and 0.970, respectively), demonstrating its effectiveness in predicting lignin solubility based on structure and solvent properties. Furthermore, SHAP analysis elucidated the influence of individual molecular features on solubility predictions, contributing to our understanding of how the lignin structure influences solubility. This study provides valuable insights into the selection of highly soluble green solvents and the preparation of monodisperse lignin.
期刊介绍:
Biomacromolecules is a leading forum for the dissemination of cutting-edge research at the interface of polymer science and biology. Submissions to Biomacromolecules should contain strong elements of innovation in terms of macromolecular design, synthesis and characterization, or in the application of polymer materials to biology and medicine.
Topics covered by Biomacromolecules include, but are not exclusively limited to: sustainable polymers, polymers based on natural and renewable resources, degradable polymers, polymer conjugates, polymeric drugs, polymers in biocatalysis, biomacromolecular assembly, biomimetic polymers, polymer-biomineral hybrids, biomimetic-polymer processing, polymer recycling, bioactive polymer surfaces, original polymer design for biomedical applications such as immunotherapy, drug delivery, gene delivery, antimicrobial applications, diagnostic imaging and biosensing, polymers in tissue engineering and regenerative medicine, polymeric scaffolds and hydrogels for cell culture and delivery.