{"title":"T- Hop:研究分子图中重要路径信息以预测化学性质的框架","authors":"Abdulrahman Ibraheem, Narsis Kiani, Jesper Tegner","doi":"arxiv-2407.14270","DOIUrl":null,"url":null,"abstract":"This paper studies the usefulness of incorporating path information in\npredicting chemical properties from molecular graphs, in the domain of QSAR\n(Quantitative Structure-Activity Relationship). Towards this, we developed a\nGNN-style model which can be toggled to operate in one of two modes: a\nnon-degenerate mode which incorporates path information, and a degenerate mode\nwhich leaves out path information. Thus, by comparing the performance of the\nnon-degenerate mode versus the degenerate mode on relevant QSAR datasets, we\nwere able to directly assess the significance of path information on those\ndatasets. Our results corroborate previous works, by suggesting that the\nusefulness of path information is datasetdependent. Unlike previous studies\nhowever, we took the very first steps towards building a model that could\npredict upfront whether or not path information would be useful for a given\ndataset at hand. Moreover, we also found that, albeit its simplicity, the\ndegenerate mode of our model yielded rather surprising results, which\noutperformed more sophisticated SOTA models in certain cases.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"129 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"T- Hop: A framework for studying the importance path information in molecular graphs for chemical property prediction\",\"authors\":\"Abdulrahman Ibraheem, Narsis Kiani, Jesper Tegner\",\"doi\":\"arxiv-2407.14270\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper studies the usefulness of incorporating path information in\\npredicting chemical properties from molecular graphs, in the domain of QSAR\\n(Quantitative Structure-Activity Relationship). Towards this, we developed a\\nGNN-style model which can be toggled to operate in one of two modes: a\\nnon-degenerate mode which incorporates path information, and a degenerate mode\\nwhich leaves out path information. Thus, by comparing the performance of the\\nnon-degenerate mode versus the degenerate mode on relevant QSAR datasets, we\\nwere able to directly assess the significance of path information on those\\ndatasets. Our results corroborate previous works, by suggesting that the\\nusefulness of path information is datasetdependent. Unlike previous studies\\nhowever, we took the very first steps towards building a model that could\\npredict upfront whether or not path information would be useful for a given\\ndataset at hand. Moreover, we also found that, albeit its simplicity, the\\ndegenerate mode of our model yielded rather surprising results, which\\noutperformed more sophisticated SOTA models in certain cases.\",\"PeriodicalId\":501022,\"journal\":{\"name\":\"arXiv - QuanBio - Biomolecules\",\"volume\":\"129 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - QuanBio - Biomolecules\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2407.14270\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Biomolecules","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.14270","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本文研究了在 QSAR(定量结构-活性关系)领域结合路径信息从分子图中预测化学性质的有用性。为此,我们开发了一种 GNN 式模型,该模型可切换为两种运行模式之一:一种是包含路径信息的非退化模式,另一种是不包含路径信息的退化模式。因此,通过比较非退化模式和退化模式在相关 QSAR 数据集上的性能,我们能够直接评估路径信息在这些数据集上的重要性。我们的研究结果表明,路径信息的有用性与数据集有关,从而证实了之前的研究结果。但与以往研究不同的是,我们迈出了第一步,建立了一个可以事先预测路径信息对给定数据集是否有用的模型。此外,我们还发现,尽管我们的模型很简单,但其退化模式却产生了相当令人惊讶的结果,在某些情况下,其表现优于更复杂的 SOTA 模型。
T- Hop: A framework for studying the importance path information in molecular graphs for chemical property prediction
This paper studies the usefulness of incorporating path information in
predicting chemical properties from molecular graphs, in the domain of QSAR
(Quantitative Structure-Activity Relationship). Towards this, we developed a
GNN-style model which can be toggled to operate in one of two modes: a
non-degenerate mode which incorporates path information, and a degenerate mode
which leaves out path information. Thus, by comparing the performance of the
non-degenerate mode versus the degenerate mode on relevant QSAR datasets, we
were able to directly assess the significance of path information on those
datasets. Our results corroborate previous works, by suggesting that the
usefulness of path information is datasetdependent. Unlike previous studies
however, we took the very first steps towards building a model that could
predict upfront whether or not path information would be useful for a given
dataset at hand. Moreover, we also found that, albeit its simplicity, the
degenerate mode of our model yielded rather surprising results, which
outperformed more sophisticated SOTA models in certain cases.