Inequality relations for NMR-based polymer homoblock analysis and extended application: Reanalysis of historical data on alginates, chitosans, homogalacturonans, and galactomannans
{"title":"Inequality relations for NMR-based polymer homoblock analysis and extended application: Reanalysis of historical data on alginates, chitosans, homogalacturonans, and galactomannans","authors":"Xiaohui Xing , Kanglin Xing , Yves S.Y. Hsieh , D. Wade Abbott","doi":"10.1016/j.carres.2024.109189","DOIUrl":null,"url":null,"abstract":"<div><p>There has been a long-standing bottleneck in the quantitative analysis of the frequencies of homoblock polyads beyond triads using <sup>1</sup>H and <sup>13</sup>C NMR for linear polysaccharides, primarily because monosaccharides within a long homoblock share similar chemical environments due to identical neighboring units, resulting in indistinct NMR peaks. In this study, through rigorous mathematical induction, inequality relations were established that enabled the calculation of frequency ranges of homoblock polyads from historically reported NMR-derived frequency values of diads and/or triads of alginates, chitosans, homogalacturonans, and galactomannans. The calculated homoblock frequency ranges were then applied to evaluate three chain growth statistical models, including the Bernoulli chain, first-order Markov chain, and second-order Markov chain, for predicting homoblock frequencies in these polysaccharides. Furthermore, based on the mathematically derived inequality relations, a novel 2D array was constructed, enabling the graphical visualization of homoblock features in polysaccharides. It was demonstrated, as a proof of concept, that the novel 2D array, along with a 1D code generated from it, could serve as an effective feature engineering tool for polymer classification using machine learning algorithms.</p></div>","PeriodicalId":9415,"journal":{"name":"Carbohydrate Research","volume":"542 ","pages":"Article 109189"},"PeriodicalIF":2.4000,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Carbohydrate Research","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S000862152400168X","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
There has been a long-standing bottleneck in the quantitative analysis of the frequencies of homoblock polyads beyond triads using 1H and 13C NMR for linear polysaccharides, primarily because monosaccharides within a long homoblock share similar chemical environments due to identical neighboring units, resulting in indistinct NMR peaks. In this study, through rigorous mathematical induction, inequality relations were established that enabled the calculation of frequency ranges of homoblock polyads from historically reported NMR-derived frequency values of diads and/or triads of alginates, chitosans, homogalacturonans, and galactomannans. The calculated homoblock frequency ranges were then applied to evaluate three chain growth statistical models, including the Bernoulli chain, first-order Markov chain, and second-order Markov chain, for predicting homoblock frequencies in these polysaccharides. Furthermore, based on the mathematically derived inequality relations, a novel 2D array was constructed, enabling the graphical visualization of homoblock features in polysaccharides. It was demonstrated, as a proof of concept, that the novel 2D array, along with a 1D code generated from it, could serve as an effective feature engineering tool for polymer classification using machine learning algorithms.
期刊介绍:
Carbohydrate Research publishes reports of original research in the following areas of carbohydrate science: action of enzymes, analytical chemistry, biochemistry (biosynthesis, degradation, structural and functional biochemistry, conformation, molecular recognition, enzyme mechanisms, carbohydrate-processing enzymes, including glycosidases and glycosyltransferases), chemical synthesis, isolation of natural products, physicochemical studies, reactions and their mechanisms, the study of structures and stereochemistry, and technological aspects.
Papers on polysaccharides should have a "molecular" component; that is a paper on new or modified polysaccharides should include structural information and characterization in addition to the usual studies of rheological properties and the like. A paper on a new, naturally occurring polysaccharide should include structural information, defining monosaccharide components and linkage sequence.
Papers devoted wholly or partly to X-ray crystallographic studies, or to computational aspects (molecular mechanics or molecular orbital calculations, simulations via molecular dynamics), will be considered if they meet certain criteria. For computational papers the requirements are that the methods used be specified in sufficient detail to permit replication of the results, and that the conclusions be shown to have relevance to experimental observations - the authors'' own data or data from the literature. Specific directions for the presentation of X-ray data are given below under Results and "discussion".