Pieter Meysman , Justin Barton , Barbara Bravi , Liel Cohen-Lavi , Vadim Karnaukhov , Elias Lilleskov , Alessandro Montemurro , Morten Nielsen , Thierry Mora , Paul Pereira , Anna Postovskaya , María Rodríguez Martínez , Jorge Fernandez-de-Cossio-Diaz , Alexandra Vujkovic , Aleksandra M. Walczak , Anna Weber , Rose Yin , Anne Eugster , Virag Sharma
{"title":"Benchmarking solutions to the T-cell receptor epitope prediction problem: IMMREP22 workshop report","authors":"Pieter Meysman , Justin Barton , Barbara Bravi , Liel Cohen-Lavi , Vadim Karnaukhov , Elias Lilleskov , Alessandro Montemurro , Morten Nielsen , Thierry Mora , Paul Pereira , Anna Postovskaya , María Rodríguez Martínez , Jorge Fernandez-de-Cossio-Diaz , Alexandra Vujkovic , Aleksandra M. Walczak , Anna Weber , Rose Yin , Anne Eugster , Virag Sharma","doi":"10.1016/j.immuno.2023.100024","DOIUrl":null,"url":null,"abstract":"<div><p>Many different solutions to predicting the cognate epitope target of a T-cell receptor (TCR) have been proposed. However several questions on the advantages and disadvantages of these different approaches remain unresolved, as most methods have only been evaluated within the context of their initial publications and data sets. Here, we report the findings of the first public TCR-epitope prediction benchmark performed on 23 prediction models in the context of the ImmRep 2022 TCR-epitope specificity workshop. This benchmark revealed that the use of paired-chain alpha-beta, as well as CDR1/2 or V/J information, when available, improves classification obtained with CDR3 data, independent of the underlying approach. In addition, we found that straight-forward distance-based approaches can achieve a respectable performance when compared to more complex machine-learning models. Finally, we highlight the need for a truly independent follow-up benchmark and provide recommendations for the design of such a next benchmark.</p></div>","PeriodicalId":73343,"journal":{"name":"Immunoinformatics (Amsterdam, Netherlands)","volume":"9 ","pages":"Article 100024"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Immunoinformatics (Amsterdam, Netherlands)","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667119023000046","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Many different solutions to predicting the cognate epitope target of a T-cell receptor (TCR) have been proposed. However several questions on the advantages and disadvantages of these different approaches remain unresolved, as most methods have only been evaluated within the context of their initial publications and data sets. Here, we report the findings of the first public TCR-epitope prediction benchmark performed on 23 prediction models in the context of the ImmRep 2022 TCR-epitope specificity workshop. This benchmark revealed that the use of paired-chain alpha-beta, as well as CDR1/2 or V/J information, when available, improves classification obtained with CDR3 data, independent of the underlying approach. In addition, we found that straight-forward distance-based approaches can achieve a respectable performance when compared to more complex machine-learning models. Finally, we highlight the need for a truly independent follow-up benchmark and provide recommendations for the design of such a next benchmark.