Aftab Farrukh, Ibrahim A Shaaban, Mohammed A Assiri, Mudassir Hussain Tahir, Zeinhom M El-Bahy
{"title":"预测水溶性有机化合物的紫外/可见吸收最大值并生成新的有机化合物库。","authors":"Aftab Farrukh, Ibrahim A Shaaban, Mohammed A Assiri, Mudassir Hussain Tahir, Zeinhom M El-Bahy","doi":"10.1016/j.saa.2024.125453","DOIUrl":null,"url":null,"abstract":"<p><p>In this study, UV/visible absorption maxima of organic compounds are predicted with the help of machine learning (ML). Four ML models are evaluated, the gradient boosting model has performed best. We also analyzed feature importance. Using Python-based tools, we generated and visualized a new set of 5,000 organic compounds. These compounds were screened based on their predicted UV/visible absorption maxima, selecting those with red-shifted absorption. The assessment of synthetic accessibility indicated that most of the chosen compounds are relatively easy to synthesize.</p>","PeriodicalId":94213,"journal":{"name":"Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy","volume":"328 ","pages":"125453"},"PeriodicalIF":0.0000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"UV/visible absorption maxima prediction of water-soluble organic compounds and generation of library of new organic compounds.\",\"authors\":\"Aftab Farrukh, Ibrahim A Shaaban, Mohammed A Assiri, Mudassir Hussain Tahir, Zeinhom M El-Bahy\",\"doi\":\"10.1016/j.saa.2024.125453\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>In this study, UV/visible absorption maxima of organic compounds are predicted with the help of machine learning (ML). Four ML models are evaluated, the gradient boosting model has performed best. We also analyzed feature importance. Using Python-based tools, we generated and visualized a new set of 5,000 organic compounds. These compounds were screened based on their predicted UV/visible absorption maxima, selecting those with red-shifted absorption. The assessment of synthetic accessibility indicated that most of the chosen compounds are relatively easy to synthesize.</p>\",\"PeriodicalId\":94213,\"journal\":{\"name\":\"Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy\",\"volume\":\"328 \",\"pages\":\"125453\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-11-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1016/j.saa.2024.125453\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.saa.2024.125453","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
在这项研究中,利用机器学习(ML)预测了有机化合物的紫外/可见吸收最大值。我们评估了四种 ML 模型,其中梯度提升模型表现最佳。我们还分析了特征的重要性。利用基于 Python 的工具,我们生成并可视化了一组新的 5,000 种有机化合物。我们根据预测的紫外/可见吸收最大值对这些化合物进行了筛选,选出了具有红移吸收的化合物。对合成可得性的评估表明,大多数被选中的化合物都比较容易合成。
UV/visible absorption maxima prediction of water-soluble organic compounds and generation of library of new organic compounds.
In this study, UV/visible absorption maxima of organic compounds are predicted with the help of machine learning (ML). Four ML models are evaluated, the gradient boosting model has performed best. We also analyzed feature importance. Using Python-based tools, we generated and visualized a new set of 5,000 organic compounds. These compounds were screened based on their predicted UV/visible absorption maxima, selecting those with red-shifted absorption. The assessment of synthetic accessibility indicated that most of the chosen compounds are relatively easy to synthesize.