{"title":"理解化学数据的形态——具有持久同源性的应用。","authors":"Joshua Bilsky, Aurora E Clark","doi":"10.1063/5.0281156","DOIUrl":null,"url":null,"abstract":"<p><p>Chemical data often have complex and nonlinear patterns in how data points relate to one another. Concurrently, there are many situations where chemical data are of high dimensionality (e.g., the 3N-dimensional potential energy landscape). Both complexity and high dimensionality pose challenges for analyses that seek to uncover fundamental structure-property relationships or to develop foundational models of chemical behavior. This Perspective offers mathematical context, illustrative applications, and conceptual motivation for using persistent homology (PH) to identify and provide new physical insight into the multiple spatiotemporal-scale patterns present in chemical data. We address the implications of different data representations and highlight the relationships of PH-derived descriptors to physicochemical properties and chemical behavior. Applications in machine learning are also discussed, emphasizing how PH can enhance predictive modeling. Finally, we review commonly used PH software, offering recommendations on usability, flexibility, and data requirements.</p>","PeriodicalId":15313,"journal":{"name":"Journal of Chemical Physics","volume":"163 9","pages":""},"PeriodicalIF":3.1000,"publicationDate":"2025-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Understanding the shape of chemistry data-Applications with persistent homology.\",\"authors\":\"Joshua Bilsky, Aurora E Clark\",\"doi\":\"10.1063/5.0281156\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Chemical data often have complex and nonlinear patterns in how data points relate to one another. Concurrently, there are many situations where chemical data are of high dimensionality (e.g., the 3N-dimensional potential energy landscape). Both complexity and high dimensionality pose challenges for analyses that seek to uncover fundamental structure-property relationships or to develop foundational models of chemical behavior. This Perspective offers mathematical context, illustrative applications, and conceptual motivation for using persistent homology (PH) to identify and provide new physical insight into the multiple spatiotemporal-scale patterns present in chemical data. We address the implications of different data representations and highlight the relationships of PH-derived descriptors to physicochemical properties and chemical behavior. Applications in machine learning are also discussed, emphasizing how PH can enhance predictive modeling. Finally, we review commonly used PH software, offering recommendations on usability, flexibility, and data requirements.</p>\",\"PeriodicalId\":15313,\"journal\":{\"name\":\"Journal of Chemical Physics\",\"volume\":\"163 9\",\"pages\":\"\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-09-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Chemical Physics\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1063/5.0281156\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"CHEMISTRY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Physics","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1063/5.0281156","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
Understanding the shape of chemistry data-Applications with persistent homology.
Chemical data often have complex and nonlinear patterns in how data points relate to one another. Concurrently, there are many situations where chemical data are of high dimensionality (e.g., the 3N-dimensional potential energy landscape). Both complexity and high dimensionality pose challenges for analyses that seek to uncover fundamental structure-property relationships or to develop foundational models of chemical behavior. This Perspective offers mathematical context, illustrative applications, and conceptual motivation for using persistent homology (PH) to identify and provide new physical insight into the multiple spatiotemporal-scale patterns present in chemical data. We address the implications of different data representations and highlight the relationships of PH-derived descriptors to physicochemical properties and chemical behavior. Applications in machine learning are also discussed, emphasizing how PH can enhance predictive modeling. Finally, we review commonly used PH software, offering recommendations on usability, flexibility, and data requirements.
期刊介绍:
The Journal of Chemical Physics publishes quantitative and rigorous science of long-lasting value in methods and applications of chemical physics. The Journal also publishes brief Communications of significant new findings, Perspectives on the latest advances in the field, and Special Topic issues. The Journal focuses on innovative research in experimental and theoretical areas of chemical physics, including spectroscopy, dynamics, kinetics, statistical mechanics, and quantum mechanics. In addition, topical areas such as polymers, soft matter, materials, surfaces/interfaces, and systems of biological relevance are of increasing importance.
Topical coverage includes:
Theoretical Methods and Algorithms
Advanced Experimental Techniques
Atoms, Molecules, and Clusters
Liquids, Glasses, and Crystals
Surfaces, Interfaces, and Materials
Polymers and Soft Matter
Biological Molecules and Networks.