Machine Learning Science and Technology最新文献_第10页

Feature selection for high-dimensional neural network potentials with the adaptive group lasso 利用自适应群套索为高维神经网络电位进行特征选择

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-05-16 DOI: 10.1088/2632-2153/ad450e

Johannes Sandberg, Thomas Voigtmann, Emilie Devijver and Noel Jakse

{"title":"Feature selection for high-dimensional neural network potentials with the adaptive group lasso","authors":"Johannes Sandberg, Thomas Voigtmann, Emilie Devijver and Noel Jakse","doi":"10.1088/2632-2153/ad450e","DOIUrl":"https://doi.org/10.1088/2632-2153/ad450e","url":null,"abstract":"Neural network potentials are a powerful tool for atomistic simulations, allowing to accurately reproduce ab initio potential energy surfaces with computational performance approaching classical force fields. A central component of such potentials is the transformation of atomic positions into a set of atomic features in a most efficient and informative way. In this work, a feature selection method is introduced for high dimensional neural network potentials, based on the adaptive group lasso (AGL) approach. It is shown that the use of an embedded method, taking into account the interplay between features and their action in the estimator, is necessary to optimize the number of features. The method’s efficiency is tested on three different monoatomic systems, including Lennard–Jones as a simple test case, Aluminium as a system characterized by predominantly radial interactions, and Boron as representative of a system with strongly directional components in the interactions. The AGL is compared with unsupervised filter methods and found to perform consistently better in reducing the number of features needed to reproduce the reference simulation data at a similar level of accuracy as the starting feature set. In particular, our results show the importance of taking into account model predictions in feature selection for interatomic potentials.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"51 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141060611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A multifidelity approach to continual learning for physical systems 物理系统持续学习的多保真方法

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-05-15 DOI: 10.1088/2632-2153/ad45b2

Amanda Howard, Yucheng Fu and Panos Stinis

引用次数: 0

A comprehensive machine learning-based investigation for the index-value prediction of 2G HTS coated conductor tapes 基于机器学习的 2G HTS 涂层导体带指数值预测综合研究

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-05-12 DOI: 10.1088/2632-2153/ad45b1

Shahin Alipour Bonab, Giacomo Russo, Antonio Morandi and Mohammad Yazdani-Asrami

{"title":"A comprehensive machine learning-based investigation for the index-value prediction of 2G HTS coated conductor tapes","authors":"Shahin Alipour Bonab, Giacomo Russo, Antonio Morandi and Mohammad Yazdani-Asrami","doi":"10.1088/2632-2153/ad45b1","DOIUrl":"https://doi.org/10.1088/2632-2153/ad45b1","url":null,"abstract":"Index-value, or so-called n-value prediction is of paramount importance for understanding the superconductors’ behaviour specially when modeling of superconductors is needed. This parameter is dependent on several physical quantities including temperature, the magnetic field’s density and orientation, and affects the behaviour of high-temperature superconducting devices made out of coated conductors in terms of losses and quench propagation. In this paper, a comprehensive analysis of many machine learning (ML) methods for estimating the n-value has been carried out. The results demonstrated that cascade forward neural network (CFNN) excels in this scope. Despite needing considerably higher training time when compared to the other attempted models, it performs at the highest accuracy, with 0.48 root mean squared error (RMSE) and 99.72% Pearson coefficient for goodness of fit (R-squared). In contrast, the rigid regression method had the worst predictions with 4.92 RMSE and 37.29% R-squared. Also, random forest, boosting methods, and simple feed forward neural network can be considered as a middle accuracy model with faster training time than CFNN. The findings of this study not only advance modeling of superconductors but also pave the way for applications and further research on ML plug-and-play codes for superconducting studies including modeling of superconducting devices.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"5 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140931911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Learning a general model of single phase flow in complex 3D porous media 学习复杂三维多孔介质中单相流的一般模型

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-05-10 DOI: 10.1088/2632-2153/ad45af

Javier E Santos, Agnese Marcato, Qinjun Kang, Mohamed Mehana, Daniel O’Malley, Hari Viswanathan, Nicholas Lubbers

{"title":"Learning a general model of single phase flow in complex 3D porous media","authors":"Javier E Santos, Agnese Marcato, Qinjun Kang, Mohamed Mehana, Daniel O’Malley, Hari Viswanathan, Nicholas Lubbers","doi":"10.1088/2632-2153/ad45af","DOIUrl":"https://doi.org/10.1088/2632-2153/ad45af","url":null,"abstract":"Modeling effective transport properties of 3D porous media, such as permeability, at multiple scales is challenging as a result of the combined complexity of the pore structures and fluid physics—in particular, confinement effects which vary across the nanoscale to the microscale. While numerical simulation is possible, the computational cost is prohibitive for realistic domains, which are large and complex. Although machine learning (ML) models have been proposed to circumvent simulation, none so far has simultaneously accounted for heterogeneous 3D structures, fluid confinement effects, and multiple simulation resolutions. By utilizing numerous computer science techniques to improve the scalability of training, we have for the first time developed a general flow model that accounts for the pore-structure and corresponding physical phenomena at scales from Angstrom to the micrometer. Using synthetic computational domains for training, our ML model exhibits strong performance (<italic toggle=\"yes\">R</italic>\u0000<sup>2</sup> = 0.9) when tested on extremely diverse real domains at multiple scales.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"187 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140931932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Closed-loop Koopman operator approximation 闭环库普曼算子近似值

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-05-10 DOI: 10.1088/2632-2153/ad45b0

Steven Dahdah, James Richard Forbes

{"title":"Closed-loop Koopman operator approximation","authors":"Steven Dahdah, James Richard Forbes","doi":"10.1088/2632-2153/ad45b0","DOIUrl":"https://doi.org/10.1088/2632-2153/ad45b0","url":null,"abstract":"This paper proposes a method to identify a Koopman model of a feedback-controlled system given a known controller. The Koopman operator allows a nonlinear system to be rewritten as an infinite-dimensional linear system by viewing it in terms of an infinite set of lifting functions. A finite-dimensional approximation of the Koopman operator can be identified from data by choosing a finite subset of lifting functions and solving a regression problem in the lifted space. Existing methods are designed to identify open-loop systems. However, it is impractical or impossible to run experiments on some systems, such as unstable systems, in an open-loop fashion. The proposed method leverages the linearity of the Koopman operator, along with knowledge of the controller and the structure of the closed-loop (CL) system, to simultaneously identify the CL and plant systems. The advantages of the proposed CL Koopman operator approximation method are demonstrated in simulation using a Duffing oscillator and experimentally using a rotary inverted pendulum system. An open-source software implementation of the proposed method is publicly available, along with the experimental dataset generated for this paper.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"44 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140932006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Accuracy vs memory advantage in the quantum simulation of stochastic processes 随机过程量子模拟中的精度与内存优势

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-05-09 DOI: 10.1088/2632-2153/ad444a

Leonardo Banchi

引用次数: 0

Machine learned environment-dependent corrections for a spds∗ empirical tight-binding basis 机器学习环境对 spds∗ 经验紧密结合基础的修正

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-05-09 DOI: 10.1088/2632-2153/ad4510

Daniele Soccodato, Gabriele Penazzi, Alessandro Pecchia, Anh-Luan Phan, Matthias Auf der Maur

{"title":"Machine learned environment-dependent corrections for a spds∗ empirical tight-binding basis","authors":"Daniele Soccodato, Gabriele Penazzi, Alessandro Pecchia, Anh-Luan Phan, Matthias Auf der Maur","doi":"10.1088/2632-2153/ad4510","DOIUrl":"https://doi.org/10.1088/2632-2153/ad4510","url":null,"abstract":"Empirical tight-binding (ETB) methods have become a common choice to simulate electronic and transport properties for systems composed of thousands of atoms. However, their performance is profoundly dependent on the way the empirical parameters were fitted, and the found parametrizations often exhibit poor transferability. In order to mitigate some of the the criticalities of this method, we introduce a novel Δ-learning scheme, called MLΔTB. After being trained on a custom data set composed of <italic toggle=\"yes\">ab-initio</italic> band structures, the framework is able to correlate the local atomistic environment to a correction on the on-site ETB parameters, for each atom in the system. The converged algorithm is applied to simulate the electronic properties of random GaAsSb alloys, and displays remarkable agreement both with experimental and <italic toggle=\"yes\">ab-initio</italic> test data. Some noteworthy characteristics of MLΔTB include the ability to be trained on few instances, to be applied on 3D supercells of arbitrary size, to be rotationally invariant, and to predict physical properties that are not exhibited by the training set.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"62 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140931909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Physics-informed neural networks for an optimal counterdiabatic quantum computation 用于最佳逆绝热量子计算的物理信息神经网络

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-05-09 DOI: 10.1088/2632-2153/ad450f

Antonio Ferrer-Sánchez, Carlos Flores-Garrigos, Carlos Hernani-Morales, José J Orquín-Marqués, Narendra N Hegade, Alejandro Gomez Cadavid, Iraitz Montalban, Enrique Solano, Yolanda Vives-Gilabert, José D Martín-Guerrero

{"title":"Physics-informed neural networks for an optimal counterdiabatic quantum computation","authors":"Antonio Ferrer-Sánchez, Carlos Flores-Garrigos, Carlos Hernani-Morales, José J Orquín-Marqués, Narendra N Hegade, Alejandro Gomez Cadavid, Iraitz Montalban, Enrique Solano, Yolanda Vives-Gilabert, José D Martín-Guerrero","doi":"10.1088/2632-2153/ad450f","DOIUrl":"https://doi.org/10.1088/2632-2153/ad450f","url":null,"abstract":"A novel methodology that leverages physics-informed neural networks to optimize quantum circuits in systems with <inline-formula>\u0000<tex-math><?CDATA $mathrm{N}_{mathrm{Q}}$?></tex-math>\u0000<mml:math overflow=\"scroll\"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant=\"normal\">N</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mi mathvariant=\"normal\">Q</mml:mi></mml:mrow></mml:mrow></mml:msub></mml:mrow></mml:math>\u0000<inline-graphic xlink:href=\"mlstad450fieqn1.gif\" xlink:type=\"simple\"></inline-graphic>\u0000</inline-formula> qubits by addressing the counterdiabatic (CD) protocol is introduced. The primary purpose is to employ physics-inspired deep learning techniques for accurately modeling the time evolution of various physical observables within quantum systems. To achieve this, we integrate essential physical information into an underlying neural network to effectively tackle the problem. Specifically, the imposition of the solution to meet the principle of least action, along with the hermiticity condition on all physical observables, among others, ensuring the acquisition of appropriate CD terms based on underlying physics. This approach provides a reliable alternative to previous methodologies relying on classical numerical approximations, eliminating their inherent constraints. The proposed method offers a versatile framework for optimizing physical observables relevant to the problem, such as the scheduling function, gauge potential, temporal evolution of energy levels, among others. This methodology has been successfully applied to 2-qubit representing <inline-formula>\u0000<tex-math><?CDATA $mathrm{H}_{2}$?></tex-math>\u0000<mml:math overflow=\"scroll\"><mml:mrow><mml:msub><mml:mrow><mml:mi mathvariant=\"normal\">H</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math>\u0000<inline-graphic xlink:href=\"mlstad450fieqn2.gif\" xlink:type=\"simple\"></inline-graphic>\u0000</inline-formula> molecule using the STO-3G basis, demonstrating the derivation of a desirable decomposition for non-adiabatic terms through a linear combination of Pauli operators. This attribute confers significant advantages for practical implementation within quantum computing algorithms.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"185 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140931908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Towards a machine-learned Poisson solver for low-temperature plasma simulations in complex geometries 开发用于复杂几何形状低温等离子体模拟的机器学习泊松求解器

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-05-06 DOI: 10.1088/2632-2153/ad4230

Ihda Chaerony Siffa, Markus M Becker, Klaus-Dieter Weltmann and Jan Trieschmann

{"title":"Towards a machine-learned Poisson solver for low-temperature plasma simulations in complex geometries","authors":"Ihda Chaerony Siffa, Markus M Becker, Klaus-Dieter Weltmann and Jan Trieschmann","doi":"10.1088/2632-2153/ad4230","DOIUrl":"https://doi.org/10.1088/2632-2153/ad4230","url":null,"abstract":"Poisson’s equation plays an important role in modeling many physical systems. In electrostatic self-consistent low-temperature plasma (LTP) simulations, Poisson’s equation is solved at each simulation time step, which can amount to a significant computational cost for the entire simulation. In this paper, we describe the development of a generic machine-learned Poisson solver specifically designed for the requirements of LTP simulations in complex 2D reactor geometries on structured Cartesian grids. Here, the reactor geometries can consist of inner electrodes and dielectric materials as often found in LTP simulations. The approach leverages a hybrid CNN-transformer network architecture in combination with a weighted multiterm loss function. We train the network using highly randomized synthetic data to ensure the generalizability of the learned solver to unseen reactor geometries. The results demonstrate that the learned solver is able to produce quantitatively and qualitatively accurate solutions. Furthermore, it generalizes well on new reactor geometries such as reference geometries found in the literature. To increase the numerical accuracy of the solutions required in LTP simulations, we employ a conventional iterative solver to refine the raw predictions, especially to recover the high-frequency features not resolved by the initial prediction. With this, the proposed learned Poisson solver provides the required accuracy and is potentially faster than a pure GPU-based conventional iterative solver. This opens up new possibilities for developing a generic and high-performing learned Poisson solver for LTP systems in complex geometries.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"23 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140888473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Distilling particle knowledge for fast reconstruction at high-energy physics experiments 提炼粒子知识，促进高能物理实验的快速重建

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-05-06 DOI: 10.1088/2632-2153/ad43b1

A Bal, T Brandes, F Iemmi, M Klute, B Maier, V Mikuni and T K Årrestad

{"title":"Distilling particle knowledge for fast reconstruction at high-energy physics experiments","authors":"A Bal, T Brandes, F Iemmi, M Klute, B Maier, V Mikuni and T K Årrestad","doi":"10.1088/2632-2153/ad43b1","DOIUrl":"https://doi.org/10.1088/2632-2153/ad43b1","url":null,"abstract":"Knowledge distillation is a form of model compression that allows artificial neural networks of different sizes to learn from one another. Its main application is the compactification of large deep neural networks to free up computational resources, in particular on edge devices. In this article, we consider proton-proton collisions at the High-Luminosity Large Hadron Collider (HL-LHC) and demonstrate a successful knowledge transfer from an event-level graph neural network (GNN) to a particle-level small deep neural network (DNN). Our algorithm, DistillNet, is a DNN that is trained to learn about the provenance of particles, as provided by the soft labels that are the GNN outputs, to predict whether or not a particle originates from the primary interaction vertex. The results indicate that for this problem, which is one of the main challenges at the HL-LHC, there is minimal loss during the transfer of knowledge to the small student network, while improving significantly the computational resource needs compared to the teacher. This is demonstrated for the distilled student network on a CPU, as well as for a quantized and pruned student network deployed on an field programmable gate array. Our study proves that knowledge transfer between networks of different complexity can be used for fast artificial intelligence (AI) in high-energy physics that improves the expressiveness of observables over non-AI-based reconstruction algorithms. Such an approach can become essential at the HL-LHC experiments, e.g. to comply with the resource budget of their trigger stages.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"15 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140888488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0