{"title":"Mechanical property prediction of random copolymers using uncertainty-based active learning","authors":"Wei-Che Chang , Zong-Yun Tsai , Chin-Wen Chen , Chi-Hua Yu , Chuin-Shan Chen","doi":"10.1016/j.commatsci.2024.113489","DOIUrl":null,"url":null,"abstract":"<div><div>The copolymer, a widely used material in our daily lives, presents a significant challenge in targeted sequence design. While recent advancements in computational simulation and data science offer a promising avenue for addressing this complex issue, challenges persist in labeled data scarcity. In this study, we introduce an uncertainty-based active learning framework for predicting the properties of random copolymers. We found that the active learning strategy allowed for labeling only 40 data points within the design space of 1550 data points, drastically reducing the labeling efforts by 97%. Most data selected by active learning were positioned on the design space’s periphery, transforming the learning task into an interpolation problem. Through integrating active learning and molecular dynamics, we successfully overcame the combinatorial explosion problem in copolymer sequence design, streamlining the data labeling process and culminating in a highly accurate model. This research demonstrates data science’s potential in polymer design, especially when facing data scarcity.</div></div>","PeriodicalId":10650,"journal":{"name":"Computational Materials Science","volume":"247 ","pages":"Article 113489"},"PeriodicalIF":3.1000,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Materials Science","FirstCategoryId":"88","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0927025624007109","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The copolymer, a widely used material in our daily lives, presents a significant challenge in targeted sequence design. While recent advancements in computational simulation and data science offer a promising avenue for addressing this complex issue, challenges persist in labeled data scarcity. In this study, we introduce an uncertainty-based active learning framework for predicting the properties of random copolymers. We found that the active learning strategy allowed for labeling only 40 data points within the design space of 1550 data points, drastically reducing the labeling efforts by 97%. Most data selected by active learning were positioned on the design space’s periphery, transforming the learning task into an interpolation problem. Through integrating active learning and molecular dynamics, we successfully overcame the combinatorial explosion problem in copolymer sequence design, streamlining the data labeling process and culminating in a highly accurate model. This research demonstrates data science’s potential in polymer design, especially when facing data scarcity.
期刊介绍:
The goal of Computational Materials Science is to report on results that provide new or unique insights into, or significantly expand our understanding of, the properties of materials or phenomena associated with their design, synthesis, processing, characterization, and utilization. To be relevant to the journal, the results should be applied or applicable to specific material systems that are discussed within the submission.