Junhao Liang, Caiyuan Ye, Xinyi Lin, Chunlin Yu, Yilimiranmu Rouzhahong, Chao Liang* and Huashan Li*,
{"title":"结合表面特征和晶体相似度的机器学习探索二维材料。","authors":"Junhao Liang, Caiyuan Ye, Xinyi Lin, Chunlin Yu, Yilimiranmu Rouzhahong, Chao Liang* and Huashan Li*, ","doi":"10.1021/acs.jpclett.5c01652","DOIUrl":null,"url":null,"abstract":"<p >The property prediction of two-dimensional (2D) materials has been constrained to specific systems or the highly self-similar Computational 2D Materials Database (C2DB) due to limited data availability and inadequate feature extraction. These challenges impede extrapolated predictions and large-scale exploitation across diverse databases. We developed a model named crystal surface and cluster network (CCSN) that integrates crystal surface and cluster features while leveraging transfer learning from bulk material databases to enhance model performance. Compared to the widely adopted CGCNN model, our approach achieved a 30% reduction in mean absolute error for bandgap prediction on the Materials Cloud 2D crystals database, characterized by low self-similarity and limited data volume. To enable extrapolated predictions, we developed a prediction workflow based on crystal similarity, which selects the most similar database for model training and determines the necessity of applying transfer learning. This method was applied to predict bandgaps of 8,218 crystals without bandgap values in the C2DB database and subsequently validated through our DFT calculations. The proposed prediction workflow, based on the CCSN model, would enable the discovery of 2D materials through rapid property prediction and can be generalized to improve any prediction models dealing with scarce or biased data sets.</p>","PeriodicalId":62,"journal":{"name":"The Journal of Physical Chemistry Letters","volume":"16 34","pages":"8723–8730"},"PeriodicalIF":4.6000,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning Integrating Surface Features and Crystal Similarity for Exploring 2D Materials\",\"authors\":\"Junhao Liang, Caiyuan Ye, Xinyi Lin, Chunlin Yu, Yilimiranmu Rouzhahong, Chao Liang* and Huashan Li*, \",\"doi\":\"10.1021/acs.jpclett.5c01652\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >The property prediction of two-dimensional (2D) materials has been constrained to specific systems or the highly self-similar Computational 2D Materials Database (C2DB) due to limited data availability and inadequate feature extraction. These challenges impede extrapolated predictions and large-scale exploitation across diverse databases. We developed a model named crystal surface and cluster network (CCSN) that integrates crystal surface and cluster features while leveraging transfer learning from bulk material databases to enhance model performance. Compared to the widely adopted CGCNN model, our approach achieved a 30% reduction in mean absolute error for bandgap prediction on the Materials Cloud 2D crystals database, characterized by low self-similarity and limited data volume. To enable extrapolated predictions, we developed a prediction workflow based on crystal similarity, which selects the most similar database for model training and determines the necessity of applying transfer learning. This method was applied to predict bandgaps of 8,218 crystals without bandgap values in the C2DB database and subsequently validated through our DFT calculations. The proposed prediction workflow, based on the CCSN model, would enable the discovery of 2D materials through rapid property prediction and can be generalized to improve any prediction models dealing with scarce or biased data sets.</p>\",\"PeriodicalId\":62,\"journal\":{\"name\":\"The Journal of Physical Chemistry Letters\",\"volume\":\"16 34\",\"pages\":\"8723–8730\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Journal of Physical Chemistry Letters\",\"FirstCategoryId\":\"1\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/acs.jpclett.5c01652\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journal of Physical Chemistry Letters","FirstCategoryId":"1","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.jpclett.5c01652","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
Machine Learning Integrating Surface Features and Crystal Similarity for Exploring 2D Materials
The property prediction of two-dimensional (2D) materials has been constrained to specific systems or the highly self-similar Computational 2D Materials Database (C2DB) due to limited data availability and inadequate feature extraction. These challenges impede extrapolated predictions and large-scale exploitation across diverse databases. We developed a model named crystal surface and cluster network (CCSN) that integrates crystal surface and cluster features while leveraging transfer learning from bulk material databases to enhance model performance. Compared to the widely adopted CGCNN model, our approach achieved a 30% reduction in mean absolute error for bandgap prediction on the Materials Cloud 2D crystals database, characterized by low self-similarity and limited data volume. To enable extrapolated predictions, we developed a prediction workflow based on crystal similarity, which selects the most similar database for model training and determines the necessity of applying transfer learning. This method was applied to predict bandgaps of 8,218 crystals without bandgap values in the C2DB database and subsequently validated through our DFT calculations. The proposed prediction workflow, based on the CCSN model, would enable the discovery of 2D materials through rapid property prediction and can be generalized to improve any prediction models dealing with scarce or biased data sets.
期刊介绍:
The Journal of Physical Chemistry (JPC) Letters is devoted to reporting new and original experimental and theoretical basic research of interest to physical chemists, biophysical chemists, chemical physicists, physicists, material scientists, and engineers. An important criterion for acceptance is that the paper reports a significant scientific advance and/or physical insight such that rapid publication is essential. Two issues of JPC Letters are published each month.