Bo Xiao, Nafees Ahmad, Asif Mahmood, Mohamed H. Helal
{"title":"Data Mining and Machine Learning Analysis to Find Polymers for Electronic and Photovoltaics Applications: A Goal to Achieve Higher Dielectric Constant","authors":"Bo Xiao, Nafees Ahmad, Asif Mahmood, Mohamed H. Helal","doi":"10.1002/adts.202500166","DOIUrl":null,"url":null,"abstract":"The discovery of polymers with high dielectric constants is of significant interest for advanced electronic applications, such as capacitors, flexible electronics, and energy storage devices. In this study, data mining and machine learning (ML) techniques are applied to identify polymers with superior dielectric constant. Molecular descriptors are calculated. These descriptors are used to train several machine learning models, including linear regression, gradient booting regression, histgradient boosting regression, bagging regression, decision tree regression, and random forest regression. By employing cross-validation and hyperparameter tuning, best model is optimized for robust predictive performance. A database of 10k polymers is generated and their dielectric constant is predicted best ML model. Thirty polymers with higher dielectric constant values are selected. This work demonstrates the power of data-driven approaches in accelerating the discovery of high-performance polymers for electronic applications.","PeriodicalId":7219,"journal":{"name":"Advanced Theory and Simulations","volume":"92 1","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Theory and Simulations","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1002/adts.202500166","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The discovery of polymers with high dielectric constants is of significant interest for advanced electronic applications, such as capacitors, flexible electronics, and energy storage devices. In this study, data mining and machine learning (ML) techniques are applied to identify polymers with superior dielectric constant. Molecular descriptors are calculated. These descriptors are used to train several machine learning models, including linear regression, gradient booting regression, histgradient boosting regression, bagging regression, decision tree regression, and random forest regression. By employing cross-validation and hyperparameter tuning, best model is optimized for robust predictive performance. A database of 10k polymers is generated and their dielectric constant is predicted best ML model. Thirty polymers with higher dielectric constant values are selected. This work demonstrates the power of data-driven approaches in accelerating the discovery of high-performance polymers for electronic applications.
期刊介绍:
Advanced Theory and Simulations is an interdisciplinary, international, English-language journal that publishes high-quality scientific results focusing on the development and application of theoretical methods, modeling and simulation approaches in all natural science and medicine areas, including:
materials, chemistry, condensed matter physics
engineering, energy
life science, biology, medicine
atmospheric/environmental science, climate science
planetary science, astronomy, cosmology
method development, numerical methods, statistics