A detailed study of interpretability of deep neural network based top taggers

IF 6.3 2区物理与天体物理 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Machine Learning Science and Technology Pub Date : 2022-10-09 DOI:10.1088/2632-2153/ace0a1

Ayush Khot, M. Neubauer, Avik Roy

{"title":"A detailed study of interpretability of deep neural network based top taggers","authors":"Ayush Khot, M. Neubauer, Avik Roy","doi":"10.1088/2632-2153/ace0a1","DOIUrl":null,"url":null,"abstract":"Recent developments in the methods of explainable artificial intelligence (XAI) allow researchers to explore the inner workings of deep neural networks (DNNs), revealing crucial information about input–output relationships and realizing how data connects with machine learning models. In this paper we explore interpretability of DNN models designed to identify jets coming from top quark decay in high energy proton–proton collisions at the Large Hadron Collider. We review a subset of existing top tagger models and explore different quantitative methods to identify which features play the most important roles in identifying the top jets. We also investigate how and why feature importance varies across different XAI metrics, how correlations among features impact their explainability, and how latent space representations encode information as well as correlate with physically meaningful quantities. Our studies uncover some major pitfalls of existing XAI methods and illustrate how they can be overcome to obtain consistent and meaningful interpretation of these models. We additionally illustrate the activity of hidden layers as neural activation pattern diagrams and demonstrate how they can be used to understand how DNNs relay information across the layers and how this understanding can help to make such models significantly simpler by allowing effective model reoptimization and hyperparameter tuning. These studies not only facilitate a methodological approach to interpreting models but also unveil new insights about what these models learn. Incorporating these observations into augmented model design, we propose the particle flow interaction network model and demonstrate how interpretability-inspired model augmentation can improve top tagging performance.","PeriodicalId":33757,"journal":{"name":"Machine Learning Science and Technology","volume":" ","pages":""},"PeriodicalIF":6.3000,"publicationDate":"2022-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine Learning Science and Technology","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1088/2632-2153/ace0a1","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 7

Abstract

Recent developments in the methods of explainable artificial intelligence (XAI) allow researchers to explore the inner workings of deep neural networks (DNNs), revealing crucial information about input–output relationships and realizing how data connects with machine learning models. In this paper we explore interpretability of DNN models designed to identify jets coming from top quark decay in high energy proton–proton collisions at the Large Hadron Collider. We review a subset of existing top tagger models and explore different quantitative methods to identify which features play the most important roles in identifying the top jets. We also investigate how and why feature importance varies across different XAI metrics, how correlations among features impact their explainability, and how latent space representations encode information as well as correlate with physically meaningful quantities. Our studies uncover some major pitfalls of existing XAI methods and illustrate how they can be overcome to obtain consistent and meaningful interpretation of these models. We additionally illustrate the activity of hidden layers as neural activation pattern diagrams and demonstrate how they can be used to understand how DNNs relay information across the layers and how this understanding can help to make such models significantly simpler by allowing effective model reoptimization and hyperparameter tuning. These studies not only facilitate a methodological approach to interpreting models but also unveil new insights about what these models learn. Incorporating these observations into augmented model design, we propose the particle flow interaction network model and demonstrate how interpretability-inspired model augmentation can improve top tagging performance.

查看原文本刊更多论文

基于深度神经网络的顶部标签可解释性的详细研究

可解释人工智能(XAI)方法的最新发展使研究人员能够探索深度神经网络(dnn)的内部工作原理，揭示有关输入输出关系的关键信息，并实现数据如何与机器学习模型连接。在本文中，我们探讨了DNN模型的可解释性，该模型设计用于识别大型强子对撞机高能质子-质子碰撞中顶夸克衰变产生的射流。我们回顾了现有的顶部标记器模型的子集，并探索了不同的定量方法来确定哪些特征在识别顶部射流中起着最重要的作用。我们还研究了特征重要性在不同的XAI度量中如何以及为什么变化，特征之间的相关性如何影响其可解释性，以及潜在空间表示如何编码信息以及如何与物理上有意义的量相关。我们的研究揭示了现有XAI方法的一些主要缺陷，并说明了如何克服这些缺陷，以获得对这些模型的一致和有意义的解释。我们还说明了隐藏层的活动作为神经激活模式图，并演示了如何使用它们来理解dnn如何跨层传递信息，以及这种理解如何通过允许有效的模型重新优化和超参数调优来帮助使这些模型显着简化。这些研究不仅促进了解释模型的方法论方法，而且揭示了这些模型学习的新见解。将这些观察结果结合到增强模型设计中，我们提出了粒子流相互作用网络模型，并演示了受可解释性启发的模型增强如何提高顶部标记性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Machine Learning Science and Technology Computer Science-Artificial Intelligence

CiteScore

9.10

自引率

4.40%

发文量

审稿时长

5 weeks

期刊介绍： Machine Learning Science and Technology is a multidisciplinary open access journal that bridges the application of machine learning across the sciences with advances in machine learning methods and theory as motivated by physical insights. Specifically, articles must fall into one of the following categories: advance the state of machine learning-driven applications in the sciences or make conceptual, methodological or theoretical advances in machine learning with applications to, inspiration from, or motivated by scientific problems.