Machine Learning Science and Technology最新文献

Mamba time series forecasting with uncertainty quantification. 不确定量化的曼巴时间序列预测。

IF 6.3 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2025-09-30 Epub Date: 2025-07-22 DOI: 10.1088/2632-2153/adec3b

Pedro Pessoa, Paul Campitelli, Douglas P Shepherd, S Banu Ozkan, Steve Pressé

{"title":"Mamba time series forecasting with uncertainty quantification.","authors":"Pedro Pessoa, Paul Campitelli, Douglas P Shepherd, S Banu Ozkan, Steve Pressé","doi":"10.1088/2632-2153/adec3b","DOIUrl":"10.1088/2632-2153/adec3b","url":null,"abstract":"State space models, such as Mamba, have recently garnered attention in time series forecasting (TSF) due to their ability to capture sequence patterns. However, in electricity consumption benchmarks, Mamba forecasts exhibit a mean error of approximately 8%. Similarly, in traffic occupancy benchmarks, the mean error reaches 18%. This discrepancy leaves us to wonder whether the prediction is simply inaccurate or falls within error given spread in historical data. To address this limitation, we propose a method to quantify the predictive uncertainty of Mamba forecasts. To achieve this, we propose a dual-network framework based on the Mamba architecture for probabilistic forecasting, where one network generates point forecasts while the other estimates predictive uncertainty by modeling variance. We abbreviate our tool, Mamba with probabilistic TSF, as Mamba-ProbTSF and the code for its implementation is available on GitHub https://github.com/PessoaP/Mamba-ProbTSF. Evaluating this approach on synthetic and real-world benchmark datasets, we find Kullback-Leibler divergence between the learned distributions and the data-which, in the limit of infinite data, should converge to zero if the model correctly captures the underlying probability distribution-reduced to the order of 10-3 for synthetic data and 10-1 for real-world benchmark. We find that in both the electricity consumption and traffic occupancy benchmark, the true trajectory stays within the predicted uncertainty interval at the two-sigma level about 95% of the time. We further compare Mamba-ProbTSF against leading probabilistic forecast methods, DeepAR and ARIMA, and show that our method consistently achieves lower forecast errors while offering more reliable uncertainty quantification. We end with a consideration of potential limitations, adjustments to improve performance, and considerations for applying this framework to processes for purely or largely stochastic dynamics where the stochastic changes accumulate as observed, for example, in pure Brownian motion or molecular dynamics trajectories.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"6 3","pages":"035012"},"PeriodicalIF":6.3,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12281171/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144699735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Prior guided deep difference meta-learner for fast adaptation to stylized segmentation. 先验引导深度差异元学习器快速适应程式化分割。

IF 6.3 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2025-06-30 Epub Date: 2025-04-16 DOI: 10.1088/2632-2153/adc970

Dan Nguyen, Anjali Balagopal, Ti Bai, Michael Dohopolski, Mu-Han Lin, Steve Jiang

{"title":"Prior guided deep difference meta-learner for fast adaptation to stylized segmentation.","authors":"Dan Nguyen, Anjali Balagopal, Ti Bai, Michael Dohopolski, Mu-Han Lin, Steve Jiang","doi":"10.1088/2632-2153/adc970","DOIUrl":"https://doi.org/10.1088/2632-2153/adc970","url":null,"abstract":"Radiotherapy treatment planning requires segmenting anatomical structures in various styles, influenced by guidelines, protocols, preferences, or dose planning needs. Deep learning-based auto-segmentation models, trained on anatomical definitions, may not match local clinicians' styles at new institutions. Adapting these models can be challenging without sufficient resources. We hypothesize that consistent differences between segmentation styles and anatomical definitions can be learned from initial patients and applied to pre-trained models for more precise segmentation. We propose a Prior-guided deep difference meta-learner (DDL) to learn and adapt these differences. We collected data from 440 patients for model development and 30 for testing. The dataset includes contours of the prostate clinical target volume (CTV), parotid, and rectum. We developed a deep learning framework that segments new images with a matching style using example styles as a prior, without model retraining. The pre-trained segmentation models were adapted to three different clinician styles for post-operative CTV for prostate, parotid gland, and rectum segmentation. We tested the model's ability to learn unseen styles and compared its performance with transfer learning, using varying amounts of prior patient style data (0-10 patients). Performance was quantitatively evaluated using dice similarity coefficient (DSC) and Hausdorff distance. With exposure to only three patients for the model, the average DSC (%) improved from 78.6, 71.9, 63.0, 69.6, 52.2 and 46.3-84.4, 77.8, 73.0, 77.8, 70.5, 68.1, for CTVstyle1, CTVstyle2, CTVstyle3, Parotidsuperficial, Rectumsuperior, and Rectumposterior, respectively. The proposed Prior-guided DDL is a fast and effortless network for adapting a structure to new styles. The improved segmentation accuracy may result in reduced contour editing time, providing a more efficient and streamlined clinical workflow.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"6 2","pages":"025016"},"PeriodicalIF":6.3,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12001319/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144002018","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Quality assurance for online adaptive radiotherapy: a secondary dose verification model with geometry-encoded U-Net. 在线自适应放射治疗的质量保证：采用几何编码 U-Net 的二次剂量验证模型。

IF 6.3 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-12-01 Epub Date: 2024-10-11 DOI: 10.1088/2632-2153/ad829e

Shunyu Yan, Austen Maniscalco, Biling Wang, Dan Nguyen, Steve Jiang, Chenyang Shen

{"title":"Quality assurance for online adaptive radiotherapy: a secondary dose verification model with geometry-encoded U-Net.","authors":"Shunyu Yan, Austen Maniscalco, Biling Wang, Dan Nguyen, Steve Jiang, Chenyang Shen","doi":"10.1088/2632-2153/ad829e","DOIUrl":"10.1088/2632-2153/ad829e","url":null,"abstract":"In online adaptive radiotherapy (ART), quick computation-based secondary dose verification is crucial for ensuring the quality of ART plans while the patient is positioned on the treatment couch. However, traditional dose verification algorithms are generally time-consuming, reducing the efficiency of ART workflow. This study aims to develop an ultra-fast deep-learning (DL) based secondary dose verification algorithm to accurately estimate dose distributions using computed tomography (CT) and fluence maps (FMs). We integrated FMs into the CT image domain by explicitly resolving the geometry of treatment delivery. For each gantry angle, an FM was constructed based on the optimized multi-leaf collimator apertures and corresponding monitoring units. To effectively encode treatment beam configuration, the constructed FMs were back-projected to <math><mrow><mn>30</mn></mrow> </math> cm away from the isocenter with respect to the exact geometry of the treatment machines. Then, a 3D U-Net was utilized to take the integrated CT and FM volume as input to estimate dose. Training and validation were performed on <math><mrow><mn>381</mn></mrow> </math> prostate cancer cases, with an additional <math><mrow><mn>40</mn></mrow> </math> testing cases for independent evaluation of model performance. The proposed model can estimate dose in ∼ <math><mrow><mn>15</mn></mrow> </math> ms for each patient. The average γ passing rate ( <math><mrow><mn>3</mn> <mi>%</mi> <mrow><mo>/</mo></mrow> <mn>2</mn> <mstyle></mstyle> <mrow><mtext>mm</mtext></mrow> </mrow> </math> , <math><mrow><mn>10</mn> <mi>%</mi></mrow> </math> threshold) for the estimated dose was 99.9% ± 0.15% on testing patients. The mean dose differences for the planning target volume and organs at risk were <math><mrow><mn>0.07</mn> <mi>%</mi> <mo>±</mo> <mn>0.34</mn> <mi>%</mi></mrow> </math> and <math><mrow><mn>0.48</mn> <mi>%</mi> <mo>±</mo> <mn>0.72</mn> <mi>%</mi></mrow> </math> , respectively. We have developed a geometry-resolved DL framework for accurate dose estimation and demonstrated its potential in real-time online ART doses verification.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"5 4","pages":"045013"},"PeriodicalIF":6.3,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11467776/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142476443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Equivariant tensor network potentials 等变张量网络势

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-09-18 DOI: 10.1088/2632-2153/ad79b5

M Hodapp and A Shapeev

{"title":"Equivariant tensor network potentials","authors":"M Hodapp and A Shapeev","doi":"10.1088/2632-2153/ad79b5","DOIUrl":"https://doi.org/10.1088/2632-2153/ad79b5","url":null,"abstract":"Machine-learning interatomic potentials (MLIPs) have made a significant contribution to the recent progress in the fields of computational materials and chemistry due to the MLIPs’ ability of accurately approximating energy landscapes of quantum-mechanical models while being orders of magnitude more computationally efficient. However, the computational cost and number of parameters of many state-of-the-art MLIPs increases exponentially with the number of atomic features. Tensor (non-neural) networks, based on low-rank representations of high-dimensional tensors, have been a way to reduce the number of parameters in approximating multidimensional functions, however, it is often not easy to encode the model symmetries into them. In this work we develop a formalism for rank-efficient equivariant tensor networks (ETNs), i.e. tensor networks that remain invariant under actions of SO(3) upon contraction. All the key algorithms of tensor networks like orthogonalization of cores and DMRG-based algorithms carry over to our equivariant case. Moreover, we show that many elements of modern neural network architectures like message passing, pulling, or attention mechanisms, can in some form be implemented into the ETNs. Based on ETNs, we develop a new class of polynomial-based MLIPs that demonstrate superior performance over existing MLIPs for multicomponent systems.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"4 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142268878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimizing ZX-diagrams with deep reinforcement learning 利用深度强化学习优化 ZX 图

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-09-18 DOI: 10.1088/2632-2153/ad76f7

Maximilian Nägele and Florian Marquardt

引用次数: 0

DiffLense: a conditional diffusion model for super-resolution of gravitational lensing data DiffLense：引力透镜数据超分辨率的条件扩散模型

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-09-18 DOI: 10.1088/2632-2153/ad76f8

Pranath Reddy, Michael W Toomey, Hanna Parul and Sergei Gleyzer

{"title":"DiffLense: a conditional diffusion model for super-resolution of gravitational lensing data","authors":"Pranath Reddy, Michael W Toomey, Hanna Parul and Sergei Gleyzer","doi":"10.1088/2632-2153/ad76f8","DOIUrl":"https://doi.org/10.1088/2632-2153/ad76f8","url":null,"abstract":"Gravitational lensing data is frequently collected at low resolution due to instrumental limitations and observing conditions. Machine learning-based super-resolution techniques offer a method to enhance the resolution of these images, enabling more precise measurements of lensing effects and a better understanding of the matter distribution in the lensing system. This enhancement can significantly improve our knowledge of the distribution of mass within the lensing galaxy and its environment, as well as the properties of the background source being lensed. Traditional super-resolution techniques typically learn a mapping function from lower-resolution to higher-resolution samples. However, these methods are often constrained by their dependence on optimizing a fixed distance function, which can result in the loss of intricate details crucial for astrophysical analysis. In this work, we introduce DiffLense, a novel super-resolution pipeline based on a conditional diffusion model specifically designed to enhance the resolution of gravitational lensing images obtained from the Hyper Suprime-Cam Subaru Strategic Program (HSC-SSP). Our approach adopts a generative model, leveraging the detailed structural information present in Hubble space telescope (HST) counterparts. The diffusion model, trained to generate HST data, is conditioned on HSC data pre-processed with denoising techniques and thresholding to significantly reduce noise and background interference. This process leads to a more distinct and less overlapping conditional distribution during the model’s training phase. We demonstrate that DiffLense outperforms existing state-of-the-art single-image super-resolution techniques, particularly in retaining the fine details necessary for astrophysical analyses.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"70 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142255166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Masked particle modeling on sets: towards self-supervised high energy physics foundation models 集合上的掩蔽粒子建模：走向自监督高能物理基础模型

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-09-16 DOI: 10.1088/2632-2153/ad64a8

Tobias Golling, Lukas Heinrich, Michael Kagan, Samuel Klein, Matthew Leigh, Margarita Osadchy and John Andrew Raine

{"title":"Masked particle modeling on sets: towards self-supervised high energy physics foundation models","authors":"Tobias Golling, Lukas Heinrich, Michael Kagan, Samuel Klein, Matthew Leigh, Margarita Osadchy and John Andrew Raine","doi":"10.1088/2632-2153/ad64a8","DOIUrl":"https://doi.org/10.1088/2632-2153/ad64a8","url":null,"abstract":"We propose masked particle modeling (MPM) as a self-supervised method for learning generic, transferable, and reusable representations on unordered sets of inputs for use in high energy physics (HEP) scientific data. This work provides a novel scheme to perform masked modeling based pre-training to learn permutation invariant functions on sets. More generally, this work provides a step towards building large foundation models for HEP that can be generically pre-trained with self-supervised learning and later fine-tuned for a variety of down-stream tasks. In MPM, particles in a set are masked and the training objective is to recover their identity, as defined by a discretized token representation of a pre-trained vector quantized variational autoencoder. We study the efficacy of the method in samples of high energy jets at collider physics experiments, including studies on the impact of discretization, permutation invariance, and ordering. We also study the fine-tuning capability of the model, showing that it can be adapted to tasks such as supervised and weakly supervised jet classification, and that the model can transfer efficiently with small fine-tuning data sets to new classes and new data domains.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"75 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142255167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Transforming the bootstrap: using transformers to compute scattering amplitudes in planar N =... 转换自举法：使用转换器计算平面 N =...

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-09-15 DOI: 10.1088/2632-2153/ad743e

Tianji Cai, Garrett W Merz, François Charton, Niklas Nolte, Matthias Wilhelm, Kyle Cranmer and Lance J Dixon

引用次数: 0

Learning on the correctness class for domain inverse problems of gravimetry 关于重力测量领域反问题正确性类的学习

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-09-11 DOI: 10.1088/2632-2153/ad72cc

Yihang Chen and Wenbin Li

引用次数: 0

A combined modeling method for complex multi-fidelity data fusion 复杂多保真数据融合的组合建模方法

IF 6.8 2区物理与天体物理

Machine Learning Science and Technology Pub Date : 2024-09-10 DOI: 10.1088/2632-2153/ad718f

Lei Tang, Feng Liu, Anping Wu, Yubo Li, Wanqiu Jiang, Qingfeng Wang and Jun Huang

{"title":"A combined modeling method for complex multi-fidelity data fusion","authors":"Lei Tang, Feng Liu, Anping Wu, Yubo Li, Wanqiu Jiang, Qingfeng Wang and Jun Huang","doi":"10.1088/2632-2153/ad718f","DOIUrl":"https://doi.org/10.1088/2632-2153/ad718f","url":null,"abstract":"Currently, mainstream methods for multi-fidelity data fusion have achieved great success in many fields, but they generally suffer from poor scalability. Therefore, this paper proposes a combination modeling method for complex multi-fidelity data fusion, devoted to solving the modeling problems with three types of multi-fidelity data fusion, and explores a general solution for any n types of multi-fidelity data fusion. Different from the traditional direct modeling method—Multi-Fidelity Deep Neural Network (MFDNN)—the method is an indirect modeling method. The experimental results on three representative benchmark functions and the prediction tasks of SG6043 airfoil aerodynamic performance show that combination modeling has the following advantages: (1) It can quickly establish the mapping relationship between high, medium, and low fidelity data. (2) It can effectively solve the data imbalance problem in multi-fidelity modeling. (3) Compared with MFDNN, it has stronger noise resistance and higher prediction accuracy. Additionally, this paper discusses the scalability problem of the method when n = 4 and n = 5, providing a reference for further research on the combined modeling method.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":"56 1","pages":""},"PeriodicalIF":6.8,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142197714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0