Boudewijn Hollebrands, Jos Hageman, Hans-Gerd Janssen
{"title":"Application of a Deep Learning Model to Predict Liquid Chromatography Retention Times of Food Peptides Across Chromatographic Conditions","authors":"Boudewijn Hollebrands, Jos Hageman, Hans-Gerd Janssen","doi":"10.1002/jssc.70270","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Comparing predicted and measured retention times can greatly enhance the reliability of peptide identification in LC-MS analysis of smaller, food-derived peptides where MS spectral information alone is often insufficient. Unfortunately, the extensive data sets of peptide retention times from proteomics repositories, or prediction models derived from them, have limited applicability to food-derived peptides due to the structural diversity of these peptides. To address this, we applied a transfer learning approach by fine-tuning a generic deep learning model initially trained on large proteomics datasets using our own experimental data obtained from commercial peptide standards.</p>\n <p>The method utilizes an easy to implement retraining strategy that significantly reduces data requirements and training time compared to building a model from scratch. The retrained model demonstrated strong predictive performance (<i>Q</i><sup>2</sup> > 0.98), and 95% of the retention time predictions of a yeast protein hydrolysate validation set fell within a ±1.0 min window across a wide range of chromatographic conditions, demonstrating both its robustness and practical relevance. We further validated this approach by applying it to the analysis of plant protein hydrolysates. The good performance seen showed its versatility and applicability for diverse sets of peptides including tryptic and non-tryptic peptides.</p>\n <p>Our work underscores the potential of transfer learning in chromatographic analysis, providing an efficient and adaptable tool for rapid and reliable peptide analysis in food research. Transfer learning enabled the utilization of extensive databases from the proteomics area in the much narrower and specialized field of food peptide analysis.</p>\n </div>","PeriodicalId":17098,"journal":{"name":"Journal of separation science","volume":"48 9","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of separation science","FirstCategoryId":"5","ListUrlMain":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/10.1002/jssc.70270","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Comparing predicted and measured retention times can greatly enhance the reliability of peptide identification in LC-MS analysis of smaller, food-derived peptides where MS spectral information alone is often insufficient. Unfortunately, the extensive data sets of peptide retention times from proteomics repositories, or prediction models derived from them, have limited applicability to food-derived peptides due to the structural diversity of these peptides. To address this, we applied a transfer learning approach by fine-tuning a generic deep learning model initially trained on large proteomics datasets using our own experimental data obtained from commercial peptide standards.
The method utilizes an easy to implement retraining strategy that significantly reduces data requirements and training time compared to building a model from scratch. The retrained model demonstrated strong predictive performance (Q2 > 0.98), and 95% of the retention time predictions of a yeast protein hydrolysate validation set fell within a ±1.0 min window across a wide range of chromatographic conditions, demonstrating both its robustness and practical relevance. We further validated this approach by applying it to the analysis of plant protein hydrolysates. The good performance seen showed its versatility and applicability for diverse sets of peptides including tryptic and non-tryptic peptides.
Our work underscores the potential of transfer learning in chromatographic analysis, providing an efficient and adaptable tool for rapid and reliable peptide analysis in food research. Transfer learning enabled the utilization of extensive databases from the proteomics area in the much narrower and specialized field of food peptide analysis.
期刊介绍:
The Journal of Separation Science (JSS) is the most comprehensive source in separation science, since it covers all areas of chromatographic and electrophoretic separation methods in theory and practice, both in the analytical and in the preparative mode, solid phase extraction, sample preparation, and related techniques. Manuscripts on methodological or instrumental developments, including detection aspects, in particular mass spectrometry, as well as on innovative applications will also be published. Manuscripts on hyphenation, automation, and miniaturization are particularly welcome. Pre- and post-separation facets of a total analysis may be covered as well as the underlying logic of the development or application of a method.