Nima Shoghi, Pooya Shoghi, Anuroop Sriram, Abhishek Das
{"title":"分子回归的分布学习","authors":"Nima Shoghi, Pooya Shoghi, Anuroop Sriram, Abhishek Das","doi":"arxiv-2407.20475","DOIUrl":null,"url":null,"abstract":"Using \"soft\" targets to improve model performance has been shown to be\neffective in classification settings, but the usage of soft targets for\nregression is a much less studied topic in machine learning. The existing\nliterature on the usage of soft targets for regression fails to properly assess\nthe method's limitations, and empirical evaluation is quite limited. In this\nwork, we assess the strengths and drawbacks of existing methods when applied to\nmolecular property regression tasks. Our assessment outlines key biases present\nin existing methods and proposes methods to address them, evaluated through\ncareful ablation studies. We leverage these insights to propose Distributional\nMixture of Experts (DMoE): A model-independent, and data-independent method for\nregression which trains a model to predict probability distributions of its\ntargets. Our proposed loss function combines the cross entropy between\npredicted and target distributions and the L1 distance between their expected\nvalues to produce a loss function that is robust to the outlined biases. We\nevaluate the performance of DMoE on different molecular property prediction\ndatasets -- Open Catalyst (OC20), MD17, and QM9 -- across different backbone\nmodel architectures -- SchNet, GemNet, and Graphormer. Our results demonstrate\nthat the proposed method is a promising alternative to classical regression for\nmolecular property prediction tasks, showing improvements over baselines on all\ndatasets and architectures.","PeriodicalId":501266,"journal":{"name":"arXiv - QuanBio - Quantitative Methods","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Distribution Learning for Molecular Regression\",\"authors\":\"Nima Shoghi, Pooya Shoghi, Anuroop Sriram, Abhishek Das\",\"doi\":\"arxiv-2407.20475\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Using \\\"soft\\\" targets to improve model performance has been shown to be\\neffective in classification settings, but the usage of soft targets for\\nregression is a much less studied topic in machine learning. The existing\\nliterature on the usage of soft targets for regression fails to properly assess\\nthe method's limitations, and empirical evaluation is quite limited. In this\\nwork, we assess the strengths and drawbacks of existing methods when applied to\\nmolecular property regression tasks. Our assessment outlines key biases present\\nin existing methods and proposes methods to address them, evaluated through\\ncareful ablation studies. We leverage these insights to propose Distributional\\nMixture of Experts (DMoE): A model-independent, and data-independent method for\\nregression which trains a model to predict probability distributions of its\\ntargets. Our proposed loss function combines the cross entropy between\\npredicted and target distributions and the L1 distance between their expected\\nvalues to produce a loss function that is robust to the outlined biases. We\\nevaluate the performance of DMoE on different molecular property prediction\\ndatasets -- Open Catalyst (OC20), MD17, and QM9 -- across different backbone\\nmodel architectures -- SchNet, GemNet, and Graphormer. Our results demonstrate\\nthat the proposed method is a promising alternative to classical regression for\\nmolecular property prediction tasks, showing improvements over baselines on all\\ndatasets and architectures.\",\"PeriodicalId\":501266,\"journal\":{\"name\":\"arXiv - QuanBio - Quantitative Methods\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - QuanBio - Quantitative Methods\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2407.20475\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Quantitative Methods","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.20475","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Using "soft" targets to improve model performance has been shown to be
effective in classification settings, but the usage of soft targets for
regression is a much less studied topic in machine learning. The existing
literature on the usage of soft targets for regression fails to properly assess
the method's limitations, and empirical evaluation is quite limited. In this
work, we assess the strengths and drawbacks of existing methods when applied to
molecular property regression tasks. Our assessment outlines key biases present
in existing methods and proposes methods to address them, evaluated through
careful ablation studies. We leverage these insights to propose Distributional
Mixture of Experts (DMoE): A model-independent, and data-independent method for
regression which trains a model to predict probability distributions of its
targets. Our proposed loss function combines the cross entropy between
predicted and target distributions and the L1 distance between their expected
values to produce a loss function that is robust to the outlined biases. We
evaluate the performance of DMoE on different molecular property prediction
datasets -- Open Catalyst (OC20), MD17, and QM9 -- across different backbone
model architectures -- SchNet, GemNet, and Graphormer. Our results demonstrate
that the proposed method is a promising alternative to classical regression for
molecular property prediction tasks, showing improvements over baselines on all
datasets and architectures.