{"title":"Advancements in Molecular Property Prediction: A Survey of Single and Multimodal Approaches","authors":"Tanya Liyaqat, Tanvir Ahmad, Chandni Saxena","doi":"arxiv-2408.09461","DOIUrl":null,"url":null,"abstract":"Molecular Property Prediction (MPP) plays a pivotal role across diverse\ndomains, spanning drug discovery, material science, and environmental\nchemistry. Fueled by the exponential growth of chemical data and the evolution\nof artificial intelligence, recent years have witnessed remarkable strides in\nMPP. However, the multifaceted nature of molecular data, such as molecular\nstructures, SMILES notation, and molecular images, continues to pose a\nfundamental challenge in its effective representation. To address this,\nrepresentation learning techniques are instrumental as they acquire informative\nand interpretable representations of molecular data. This article explores\nrecent AI/-based approaches in MPP, focusing on both single and multiple\nmodality representation techniques. It provides an overview of various molecule\nrepresentations and encoding schemes, categorizes MPP methods by their use of\nmodalities, and outlines datasets and tools available for feature generation.\nThe article also analyzes the performance of recent methods and suggests future\nresearch directions to advance the field of MPP.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"39 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Biomolecules","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.09461","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Molecular Property Prediction (MPP) plays a pivotal role across diverse
domains, spanning drug discovery, material science, and environmental
chemistry. Fueled by the exponential growth of chemical data and the evolution
of artificial intelligence, recent years have witnessed remarkable strides in
MPP. However, the multifaceted nature of molecular data, such as molecular
structures, SMILES notation, and molecular images, continues to pose a
fundamental challenge in its effective representation. To address this,
representation learning techniques are instrumental as they acquire informative
and interpretable representations of molecular data. This article explores
recent AI/-based approaches in MPP, focusing on both single and multiple
modality representation techniques. It provides an overview of various molecule
representations and encoding schemes, categorizes MPP methods by their use of
modalities, and outlines datasets and tools available for feature generation.
The article also analyzes the performance of recent methods and suggests future
research directions to advance the field of MPP.