Esteban Bertsch-Aguilar, Antonio Piedra, Daniel Acuña, Sebastián Suñer, Sylvana Pinheiro, William J Zamora
{"title":"LiProS: Findable, Accessible, Interoperable, and Reusable Data Simulation Workflow to Predict Accurate Lipophilicity Profiles for Small Molecules.","authors":"Esteban Bertsch-Aguilar, Antonio Piedra, Daniel Acuña, Sebastián Suñer, Sylvana Pinheiro, William J Zamora","doi":"10.1002/minf.70007","DOIUrl":null,"url":null,"abstract":"<p><p>Lipophilicity is a fundamental physicochemical property widely used to evaluate key parameters in drug design, materials science, and food engineering. It plays a critical role in predicting membrane permeability, absorption, and distribution of compounds. Moreover, lipophilicity is commonly integrated into scoring functions to model biomolecular interactions and serves as an important molecular descriptor in machine learning models for property prediction and compound classification. The election of the appropriate pH-dependent lipophilicity ( <math> <semantics><mrow><mi>log</mi> <msub><mi>D</mi> <mrow><mtext>pH</mtext></mrow> </msub> </mrow> <annotation>$$ \\mathrm{log} {D}_{pH} $$</annotation></semantics> </math> ) model is important to ensure its accuracy. The incorporation of the ion apparent partition coefficient ( <math> <semantics> <mrow><msubsup><mi>P</mi> <mi>I</mi> <mtext>app</mtext></msubsup> </mrow> <annotation>$$ {P}_{\\text{I}}^{\\text{app}}$$</annotation></semantics> </math> ) into predictions of pH-dependent lipophilicity profiles can be essential for accurately reproducing experimental results. In accordance with the principles for findable, accessible, interoperable, and reusable data to improve data management and sharing, here, we introduce LiProS, a FAIR workflow that is easily accessible through a Google Colab notebook. LiProS assists researchers in efficiently determining the appropriate pH-dependent lipophilicity profile based on the SMILES code of their molecules of interest. In addition, LiProS demonstrated its utility in the analysis of ionizable compounds within the NAPRORE-CR natural products database, enabling the identification of the most appropriate lipophilicity formalism tailored to the physicochemical characteristics of these compounds.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":"44 8","pages":"e202500136"},"PeriodicalIF":3.1000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Informatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/minf.70007","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0
Abstract
Lipophilicity is a fundamental physicochemical property widely used to evaluate key parameters in drug design, materials science, and food engineering. It plays a critical role in predicting membrane permeability, absorption, and distribution of compounds. Moreover, lipophilicity is commonly integrated into scoring functions to model biomolecular interactions and serves as an important molecular descriptor in machine learning models for property prediction and compound classification. The election of the appropriate pH-dependent lipophilicity ( ) model is important to ensure its accuracy. The incorporation of the ion apparent partition coefficient ( ) into predictions of pH-dependent lipophilicity profiles can be essential for accurately reproducing experimental results. In accordance with the principles for findable, accessible, interoperable, and reusable data to improve data management and sharing, here, we introduce LiProS, a FAIR workflow that is easily accessible through a Google Colab notebook. LiProS assists researchers in efficiently determining the appropriate pH-dependent lipophilicity profile based on the SMILES code of their molecules of interest. In addition, LiProS demonstrated its utility in the analysis of ionizable compounds within the NAPRORE-CR natural products database, enabling the identification of the most appropriate lipophilicity formalism tailored to the physicochemical characteristics of these compounds.
期刊介绍:
Molecular Informatics is a peer-reviewed, international forum for publication of high-quality, interdisciplinary research on all molecular aspects of bio/cheminformatics and computer-assisted molecular design. Molecular Informatics succeeded QSAR & Combinatorial Science in 2010.
Molecular Informatics presents methodological innovations that will lead to a deeper understanding of ligand-receptor interactions, macromolecular complexes, molecular networks, design concepts and processes that demonstrate how ideas and design concepts lead to molecules with a desired structure or function, preferably including experimental validation.
The journal''s scope includes but is not limited to the fields of drug discovery and chemical biology, protein and nucleic acid engineering and design, the design of nanomolecular structures, strategies for modeling of macromolecular assemblies, molecular networks and systems, pharmaco- and chemogenomics, computer-assisted screening strategies, as well as novel technologies for the de novo design of biologically active molecules. As a unique feature Molecular Informatics publishes so-called "Methods Corner" review-type articles which feature important technological concepts and advances within the scope of the journal.