{"title":"T- Hop: A framework for studying the importance path information in molecular graphs for chemical property prediction","authors":"Abdulrahman Ibraheem, Narsis Kiani, Jesper Tegner","doi":"arxiv-2407.14270","DOIUrl":null,"url":null,"abstract":"This paper studies the usefulness of incorporating path information in\npredicting chemical properties from molecular graphs, in the domain of QSAR\n(Quantitative Structure-Activity Relationship). Towards this, we developed a\nGNN-style model which can be toggled to operate in one of two modes: a\nnon-degenerate mode which incorporates path information, and a degenerate mode\nwhich leaves out path information. Thus, by comparing the performance of the\nnon-degenerate mode versus the degenerate mode on relevant QSAR datasets, we\nwere able to directly assess the significance of path information on those\ndatasets. Our results corroborate previous works, by suggesting that the\nusefulness of path information is datasetdependent. Unlike previous studies\nhowever, we took the very first steps towards building a model that could\npredict upfront whether or not path information would be useful for a given\ndataset at hand. Moreover, we also found that, albeit its simplicity, the\ndegenerate mode of our model yielded rather surprising results, which\noutperformed more sophisticated SOTA models in certain cases.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"129 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Biomolecules","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.14270","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper studies the usefulness of incorporating path information in
predicting chemical properties from molecular graphs, in the domain of QSAR
(Quantitative Structure-Activity Relationship). Towards this, we developed a
GNN-style model which can be toggled to operate in one of two modes: a
non-degenerate mode which incorporates path information, and a degenerate mode
which leaves out path information. Thus, by comparing the performance of the
non-degenerate mode versus the degenerate mode on relevant QSAR datasets, we
were able to directly assess the significance of path information on those
datasets. Our results corroborate previous works, by suggesting that the
usefulness of path information is datasetdependent. Unlike previous studies
however, we took the very first steps towards building a model that could
predict upfront whether or not path information would be useful for a given
dataset at hand. Moreover, we also found that, albeit its simplicity, the
degenerate mode of our model yielded rather surprising results, which
outperformed more sophisticated SOTA models in certain cases.