Trackformers: in search of transformer-based particle tracking for the high-luminosity LHC era

IF 4.2 2区物理与天体物理 Q2 PHYSICS, PARTICLES & FIELDS

The European Physical Journal C Pub Date : 2025-04-25 DOI:10.1140/epjc/s10052-025-14156-3

Sascha Caron, Nadezhda Dobreva, Antonio Ferrer Sánchez, José D. Martín-Guerrero, Uraz Odyurt, Roberto Ruiz de Austri Bazan, Zef Wolffs, Yue Zhao

{"title":"Trackformers: in search of transformer-based particle tracking for the high-luminosity LHC era","authors":"Sascha Caron, Nadezhda Dobreva, Antonio Ferrer Sánchez, José D. Martín-Guerrero, Uraz Odyurt, Roberto Ruiz de Austri Bazan, Zef Wolffs, Yue Zhao","doi":"10.1140/epjc/s10052-025-14156-3","DOIUrl":null,"url":null,"abstract":"<div><p>High-Energy Physics experiments are facing a multi-fold data increase with every new iteration. This is certainly the case for the upcoming High-Luminosity LHC upgrade. Such increased data processing requirements forces revisions to almost every step of the data processing pipeline. One such step in need of an overhaul is the task of particle track reconstruction, a.k.a., <i>tracking</i>. A Machine Learning-assisted solution is expected to provide significant improvements, since the most time-consuming step in tracking is the assignment of hits to particles or track candidates. This is the topic of this paper. We take inspiration from large language models. As such, we consider two approaches: the prediction of the next word in a sentence (next hit point in a track), as well as the one-shot prediction of all hits within an event. In an extensive design effort, we have experimented with three models based on the Transformer architecture and one model based on the U-Net architecture, performing track association predictions for collision event hit points. In our evaluation, we consider a spectrum of simple to complex representations of the problem, eliminating designs with lower metrics early on. We report extensive results, covering both prediction accuracy (score) and computational performance. We have made use of the REDVID simulation framework, as well as reductions applied to the TrackML data set, to compose five data sets from simple to complex, for our experiments. The results highlight distinct advantages among different designs in terms of prediction accuracy and computational performance, demonstrating the efficiency of our methodology. Most importantly, the results show the viability of a one-shot encoder-classifier based Transformer solution as a practical approach for the task of tracking. \n</p></div>","PeriodicalId":788,"journal":{"name":"The European Physical Journal C","volume":"85 4","pages":""},"PeriodicalIF":4.2000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1140/epjc/s10052-025-14156-3.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The European Physical Journal C","FirstCategoryId":"4","ListUrlMain":"https://link.springer.com/article/10.1140/epjc/s10052-025-14156-3","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHYSICS, PARTICLES & FIELDS","Score":null,"Total":0}

引用次数: 0

Abstract

High-Energy Physics experiments are facing a multi-fold data increase with every new iteration. This is certainly the case for the upcoming High-Luminosity LHC upgrade. Such increased data processing requirements forces revisions to almost every step of the data processing pipeline. One such step in need of an overhaul is the task of particle track reconstruction, a.k.a., tracking. A Machine Learning-assisted solution is expected to provide significant improvements, since the most time-consuming step in tracking is the assignment of hits to particles or track candidates. This is the topic of this paper. We take inspiration from large language models. As such, we consider two approaches: the prediction of the next word in a sentence (next hit point in a track), as well as the one-shot prediction of all hits within an event. In an extensive design effort, we have experimented with three models based on the Transformer architecture and one model based on the U-Net architecture, performing track association predictions for collision event hit points. In our evaluation, we consider a spectrum of simple to complex representations of the problem, eliminating designs with lower metrics early on. We report extensive results, covering both prediction accuracy (score) and computational performance. We have made use of the REDVID simulation framework, as well as reductions applied to the TrackML data set, to compose five data sets from simple to complex, for our experiments. The results highlight distinct advantages among different designs in terms of prediction accuracy and computational performance, demonstrating the efficiency of our methodology. Most importantly, the results show the viability of a one-shot encoder-classifier based Transformer solution as a practical approach for the task of tracking.

查看原文本刊更多论文

轨迹变换器：为高亮度大型强子对撞机时代寻找基于变换器的粒子轨迹

高能物理实验每迭代一次，数据量都会成倍增加。即将到来的高亮度大型强子对撞机升级当然也是如此。这种数据处理要求的提高迫使数据处理管道的几乎每一步都要进行修改。其中一个需要彻底改革的步骤就是粒子轨迹重建任务，也就是跟踪。机器学习辅助解决方案有望带来重大改进，因为跟踪过程中最耗时的步骤是为粒子或候选轨迹分配命中率。这就是本文的主题。我们从大型语言模型中汲取灵感。因此，我们考虑了两种方法：预测句子中的下一个单词（轨迹中的下一个命中点），以及一次性预测事件中的所有命中点。在广泛的设计工作中，我们尝试了基于 Transformer 架构的三个模型和基于 U-Net 架构的一个模型，对碰撞事件的命中点进行轨道关联预测。在评估过程中，我们考虑了从简单到复杂的各种问题表示，并在早期淘汰了指标较低的设计。我们报告了大量结果，包括预测准确性（得分）和计算性能。我们利用 REDVID 仿真框架以及对 TrackML 数据集的还原，组成了从简单到复杂的五个数据集，用于我们的实验。结果凸显了不同设计在预测准确性和计算性能方面的明显优势，证明了我们方法的高效性。最重要的是，实验结果表明，基于单次编码器-分类器的 Transformer 解决方案是跟踪任务的一种实用方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The European Physical Journal C 物理-物理：粒子与场物理

CiteScore

8.10

自引率

15.90%

发文量

1008

审稿时长

2-4 weeks

期刊介绍： Experimental Physics I: Accelerator Based High-Energy Physics Hadron and lepton collider physics Lepton-nucleon scattering High-energy nuclear reactions Standard model precision tests Search for new physics beyond the standard model Heavy flavour physics Neutrino properties Particle detector developments Computational methods and analysis tools Experimental Physics II: Astroparticle Physics Dark matter searches High-energy cosmic rays Double beta decay Long baseline neutrino experiments Neutrino astronomy Axions and other weakly interacting light particles Gravitational waves and observational cosmology Particle detector developments Computational methods and analysis tools Theoretical Physics I: Phenomenology of the Standard Model and Beyond Electroweak interactions Quantum chromo dynamics Heavy quark physics and quark flavour mixing Neutrino physics Phenomenology of astro- and cosmoparticle physics Meson spectroscopy and non-perturbative QCD Low-energy effective field theories Lattice field theory High temperature QCD and heavy ion physics Phenomenology of supersymmetric extensions of the SM Phenomenology of non-supersymmetric extensions of the SM Model building and alternative models of electroweak symmetry breaking Flavour physics beyond the SM Computational algorithms and tools...etc.