Andreas Tosstorff, Markus G Rudolph, Jörg Benz, Bernd Kuhn, Christian Kramer, May Sharpe, Chia-Ying Huang, Alexander Metz, Julien Hazemann, Daniel Ritz, Aengus Mac Sweeney, Michael K Gilson
{"title":"casp16实验蛋白配体数据集。","authors":"Andreas Tosstorff, Markus G Rudolph, Jörg Benz, Bernd Kuhn, Christian Kramer, May Sharpe, Chia-Ying Huang, Alexander Metz, Julien Hazemann, Daniel Ritz, Aengus Mac Sweeney, Michael K Gilson","doi":"10.1002/prot.70053","DOIUrl":null,"url":null,"abstract":"<p><p>This paper presents the experimental protein-ligand datasets used as benchmarks in the CASP 16 blind prediction experiment-the first CASP round to incorporate targets from pharmaceutical discovery projects. We have assembled and characterized protein-ligand complexes for four proteins that are known or candidate drug targets: human chymase, human cathepsin G, human autotaxin, and the SARS-CoV-2 main protease. The collection encompasses over 200 co-crystal structures at resolutions better than 2.7 Å, paired with binding affinity measurements for approximately 160 compounds covering a broad affinity range (nanomolar to high micromolar). These data enabled the CASP16 pose-prediction and affinity-prediction challenges. Many systems feature potentially challenging characteristics, including chymase's electropositive surface and acidic ligands, which require proper handling of titratable ligand groups; autotaxin complexes with and without zinc coordination; and a SARS-CoV-2 protease crystal form exhibiting an unusually open active site conformation. We describe the experimental approaches-from protein production and crystallization to binding assay development-that yielded these reference data. Contributed by scientists at F. Hoffmann-La Roche and Idorsia Pharmaceuticals, these datasets represent actual drug discovery projects and therefore provide a realistic testbed for assessing how computational methods perform on pharmaceutically relevant targets. An accompanying paper in the present special journal issue provides a comprehensive assessment of the pose and affinity predictions for these pharmaceutical protein-ligand systems.</p>","PeriodicalId":56271,"journal":{"name":"Proteins-Structure Function and Bioinformatics","volume":" ","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The CASP 16 Experimental Protein-Ligand Datasets.\",\"authors\":\"Andreas Tosstorff, Markus G Rudolph, Jörg Benz, Bernd Kuhn, Christian Kramer, May Sharpe, Chia-Ying Huang, Alexander Metz, Julien Hazemann, Daniel Ritz, Aengus Mac Sweeney, Michael K Gilson\",\"doi\":\"10.1002/prot.70053\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>This paper presents the experimental protein-ligand datasets used as benchmarks in the CASP 16 blind prediction experiment-the first CASP round to incorporate targets from pharmaceutical discovery projects. We have assembled and characterized protein-ligand complexes for four proteins that are known or candidate drug targets: human chymase, human cathepsin G, human autotaxin, and the SARS-CoV-2 main protease. The collection encompasses over 200 co-crystal structures at resolutions better than 2.7 Å, paired with binding affinity measurements for approximately 160 compounds covering a broad affinity range (nanomolar to high micromolar). These data enabled the CASP16 pose-prediction and affinity-prediction challenges. Many systems feature potentially challenging characteristics, including chymase's electropositive surface and acidic ligands, which require proper handling of titratable ligand groups; autotaxin complexes with and without zinc coordination; and a SARS-CoV-2 protease crystal form exhibiting an unusually open active site conformation. We describe the experimental approaches-from protein production and crystallization to binding assay development-that yielded these reference data. Contributed by scientists at F. Hoffmann-La Roche and Idorsia Pharmaceuticals, these datasets represent actual drug discovery projects and therefore provide a realistic testbed for assessing how computational methods perform on pharmaceutically relevant targets. An accompanying paper in the present special journal issue provides a comprehensive assessment of the pose and affinity predictions for these pharmaceutical protein-ligand systems.</p>\",\"PeriodicalId\":56271,\"journal\":{\"name\":\"Proteins-Structure Function and Bioinformatics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proteins-Structure Function and Bioinformatics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1002/prot.70053\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proteins-Structure Function and Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/prot.70053","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
This paper presents the experimental protein-ligand datasets used as benchmarks in the CASP 16 blind prediction experiment-the first CASP round to incorporate targets from pharmaceutical discovery projects. We have assembled and characterized protein-ligand complexes for four proteins that are known or candidate drug targets: human chymase, human cathepsin G, human autotaxin, and the SARS-CoV-2 main protease. The collection encompasses over 200 co-crystal structures at resolutions better than 2.7 Å, paired with binding affinity measurements for approximately 160 compounds covering a broad affinity range (nanomolar to high micromolar). These data enabled the CASP16 pose-prediction and affinity-prediction challenges. Many systems feature potentially challenging characteristics, including chymase's electropositive surface and acidic ligands, which require proper handling of titratable ligand groups; autotaxin complexes with and without zinc coordination; and a SARS-CoV-2 protease crystal form exhibiting an unusually open active site conformation. We describe the experimental approaches-from protein production and crystallization to binding assay development-that yielded these reference data. Contributed by scientists at F. Hoffmann-La Roche and Idorsia Pharmaceuticals, these datasets represent actual drug discovery projects and therefore provide a realistic testbed for assessing how computational methods perform on pharmaceutically relevant targets. An accompanying paper in the present special journal issue provides a comprehensive assessment of the pose and affinity predictions for these pharmaceutical protein-ligand systems.
期刊介绍:
PROTEINS : Structure, Function, and Bioinformatics publishes original reports of significant experimental and analytic research in all areas of protein research: structure, function, computation, genetics, and design. The journal encourages reports that present new experimental or computational approaches for interpreting and understanding data from biophysical chemistry, structural studies of proteins and macromolecular assemblies, alterations of protein structure and function engineered through techniques of molecular biology and genetics, functional analyses under physiologic conditions, as well as the interactions of proteins with receptors, nucleic acids, or other specific ligands or substrates. Research in protein and peptide biochemistry directed toward synthesizing or characterizing molecules that simulate aspects of the activity of proteins, or that act as inhibitors of protein function, is also within the scope of PROTEINS. In addition to full-length reports, short communications (usually not more than 4 printed pages) and prediction reports are welcome. Reviews are typically by invitation; authors are encouraged to submit proposed topics for consideration.