Sascha Caron, Nadezhda Dobreva, Antonio Ferrer Sánchez, José D. Martín-Guerrero, Uraz Odyurt, Roberto Ruiz de Austri Bazan, Zef Wolffs, Yue Zhao
{"title":"Trackformers: in search of transformer-based particle tracking for the high-luminosity LHC era","authors":"Sascha Caron, Nadezhda Dobreva, Antonio Ferrer Sánchez, José D. Martín-Guerrero, Uraz Odyurt, Roberto Ruiz de Austri Bazan, Zef Wolffs, Yue Zhao","doi":"10.1140/epjc/s10052-025-14156-3","DOIUrl":null,"url":null,"abstract":"<div><p>High-Energy Physics experiments are facing a multi-fold data increase with every new iteration. This is certainly the case for the upcoming High-Luminosity LHC upgrade. Such increased data processing requirements forces revisions to almost every step of the data processing pipeline. One such step in need of an overhaul is the task of particle track reconstruction, a.k.a., <i>tracking</i>. A Machine Learning-assisted solution is expected to provide significant improvements, since the most time-consuming step in tracking is the assignment of hits to particles or track candidates. This is the topic of this paper. We take inspiration from large language models. As such, we consider two approaches: the prediction of the next word in a sentence (next hit point in a track), as well as the one-shot prediction of all hits within an event. In an extensive design effort, we have experimented with three models based on the Transformer architecture and one model based on the U-Net architecture, performing track association predictions for collision event hit points. In our evaluation, we consider a spectrum of simple to complex representations of the problem, eliminating designs with lower metrics early on. We report extensive results, covering both prediction accuracy (score) and computational performance. We have made use of the REDVID simulation framework, as well as reductions applied to the TrackML data set, to compose five data sets from simple to complex, for our experiments. The results highlight distinct advantages among different designs in terms of prediction accuracy and computational performance, demonstrating the efficiency of our methodology. Most importantly, the results show the viability of a one-shot encoder-classifier based Transformer solution as a practical approach for the task of tracking. \n</p></div>","PeriodicalId":788,"journal":{"name":"The European Physical Journal C","volume":"85 4","pages":""},"PeriodicalIF":4.2000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1140/epjc/s10052-025-14156-3.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The European Physical Journal C","FirstCategoryId":"4","ListUrlMain":"https://link.springer.com/article/10.1140/epjc/s10052-025-14156-3","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHYSICS, PARTICLES & FIELDS","Score":null,"Total":0}
引用次数: 0
Abstract
High-Energy Physics experiments are facing a multi-fold data increase with every new iteration. This is certainly the case for the upcoming High-Luminosity LHC upgrade. Such increased data processing requirements forces revisions to almost every step of the data processing pipeline. One such step in need of an overhaul is the task of particle track reconstruction, a.k.a., tracking. A Machine Learning-assisted solution is expected to provide significant improvements, since the most time-consuming step in tracking is the assignment of hits to particles or track candidates. This is the topic of this paper. We take inspiration from large language models. As such, we consider two approaches: the prediction of the next word in a sentence (next hit point in a track), as well as the one-shot prediction of all hits within an event. In an extensive design effort, we have experimented with three models based on the Transformer architecture and one model based on the U-Net architecture, performing track association predictions for collision event hit points. In our evaluation, we consider a spectrum of simple to complex representations of the problem, eliminating designs with lower metrics early on. We report extensive results, covering both prediction accuracy (score) and computational performance. We have made use of the REDVID simulation framework, as well as reductions applied to the TrackML data set, to compose five data sets from simple to complex, for our experiments. The results highlight distinct advantages among different designs in terms of prediction accuracy and computational performance, demonstrating the efficiency of our methodology. Most importantly, the results show the viability of a one-shot encoder-classifier based Transformer solution as a practical approach for the task of tracking.
期刊介绍:
Experimental Physics I: Accelerator Based High-Energy Physics
Hadron and lepton collider physics
Lepton-nucleon scattering
High-energy nuclear reactions
Standard model precision tests
Search for new physics beyond the standard model
Heavy flavour physics
Neutrino properties
Particle detector developments
Computational methods and analysis tools
Experimental Physics II: Astroparticle Physics
Dark matter searches
High-energy cosmic rays
Double beta decay
Long baseline neutrino experiments
Neutrino astronomy
Axions and other weakly interacting light particles
Gravitational waves and observational cosmology
Particle detector developments
Computational methods and analysis tools
Theoretical Physics I: Phenomenology of the Standard Model and Beyond
Electroweak interactions
Quantum chromo dynamics
Heavy quark physics and quark flavour mixing
Neutrino physics
Phenomenology of astro- and cosmoparticle physics
Meson spectroscopy and non-perturbative QCD
Low-energy effective field theories
Lattice field theory
High temperature QCD and heavy ion physics
Phenomenology of supersymmetric extensions of the SM
Phenomenology of non-supersymmetric extensions of the SM
Model building and alternative models of electroweak symmetry breaking
Flavour physics beyond the SM
Computational algorithms and tools...etc.