{"title":"Using Deep Learning Techniques to Search for the MiniBooNE Low Energy Excess in MicroBooNE with > 3$sigma$ Sensitivity","authors":"J. Moon","doi":"10.2172/1767032","DOIUrl":"https://doi.org/10.2172/1767032","url":null,"abstract":"This thesis describes an analysis developed for the MicroBooNE experiment to investigate an anomalous excess of electron-like events observed in the MiniBooNE detector. The hypothesis investigated here is that the MiniBooNE anomaly represents appearance of electron neutrinos. Using an amalgam of novel Deep Learning and standard algorithmic techniques this analysis reconstructs and identifies a highly pure sample of charged current quasi-elastic muon neutrino and electron neutrino interactions. This thesis describes the steps in the analysis chain and provides data-to-simulation comparisons for each step that establish confidence in the final prediction. When interpreted in the context of a $nu e$ appearance like model, this analysis predicts a 3.2$sigma$ sensitivity to exclude a standard model fluctuation which would appear as a MiniBooNE like anomaly using $7times10^{20}$ protons on target of MicroBooNE Data.","PeriodicalId":308994,"journal":{"name":"arXiv: Data Analysis, Statistics and Probability","volume":"220 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133549453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mohammad N. Murshed, M. H. Chowdhury, Md Nazmul Islam Shuzan, Mohammad Monir Uddin
{"title":"Towards an Improved Eigensystem Realization Algorithm for Low-Error Guarantees","authors":"Mohammad N. Murshed, M. H. Chowdhury, Md Nazmul Islam Shuzan, Mohammad Monir Uddin","doi":"10.1007/978-981-16-0586-4_4","DOIUrl":"https://doi.org/10.1007/978-981-16-0586-4_4","url":null,"abstract":"","PeriodicalId":308994,"journal":{"name":"arXiv: Data Analysis, Statistics and Probability","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129047958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Measures of Spike Train Synchrony and Directionality","authors":"Eero Satuvuori, Irene Malvestio, T. Kreuz","doi":"10.1007/978-3-319-68297-6_13","DOIUrl":"https://doi.org/10.1007/978-3-319-68297-6_13","url":null,"abstract":"","PeriodicalId":308994,"journal":{"name":"arXiv: Data Analysis, Statistics and Probability","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115042469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Practical Statistics for Particle Physics","authors":"R. Barlow","doi":"10.23730/CYRSP-2020-005.149","DOIUrl":"https://doi.org/10.23730/CYRSP-2020-005.149","url":null,"abstract":"This is the write-up of a set of lectures given at the Asia Europe Pacific School of High Energy Physics in Quy Nhon, Vietnam in September 2018, to an audience of PhD students in all branches of particle physics They cover the different meanings of 'probability', particularly frequentist and Bayesian, the binomial, Poisson and Gaussian distributions, hypothesis testing, estimation, errors (including asymmetric and systematic errors) and goodness of fit. Several different methods used in setting upper limits are explained, followed by a discussion on why 5 sigma are conventionally required for a 'discovery'.","PeriodicalId":308994,"journal":{"name":"arXiv: Data Analysis, Statistics and Probability","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127271143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Data-Driven Computational Methods","authors":"J. Harlim","doi":"10.1017/9781108562461","DOIUrl":"https://doi.org/10.1017/9781108562461","url":null,"abstract":"Modern scientific computational methods are undergoing a transformative change; big data and statistical learning methods now have the potential to outperform the classical first-principles modeling paradigm. This book bridges this transition, connecting the theory of probability, stochastic processes, functional analysis, numerical analysis, and differential geometry. It describes two classes of computational methods to leverage data for modeling dynamical systems. The first is concerned with data fitting algorithms to estimate parameters in parametric models that are postulated on the basis of physical or dynamical laws. The second class is on operator estimation, which uses the data to nonparametrically approximate the operator generated by the transition function of the underlying dynamical systems. \u0000This self-contained book is suitable for graduate studies in applied mathematics, statistics, and engineering. Carefully chosen elementary examples with supplementary MATLAB codes and appendices covering the relevant prerequisite materials are provided, making it suitable for self-study.","PeriodicalId":308994,"journal":{"name":"arXiv: Data Analysis, Statistics and Probability","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127754571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An approximation theoretic perspective of the Sobol' indices with dependent variables.","authors":"Joseph L. Hart, P. Gremaud","doi":"10.1615/int.j.uncertaintyquantification.2018026498","DOIUrl":"https://doi.org/10.1615/int.j.uncertaintyquantification.2018026498","url":null,"abstract":"The Sobol' indices are a recognized tool in global sensitivity analysis. When the uncertain variables in a model are statistically independent, the Sobol' indices may be easily interpreted and utilized. However, their interpretation and utility is more challenging with statistically dependent variables. This article develops an approximation theoretic perspective to interpret Sobol' indices in the presence of variable dependencies. The value of this perspective is demonstrated in the context of dimension reduction, a common application of the Sobol' indices. Theoretical analysis and illustrative examples are provided.","PeriodicalId":308994,"journal":{"name":"arXiv: Data Analysis, Statistics and Probability","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130329792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Role of Data Analysis in Uncertainty Quantification: Case Studies for Materials Modeling","authors":"P. Patrone, Anthony J. Kearsley, A. Dienstfrey","doi":"10.2514/6.2018-0927","DOIUrl":"https://doi.org/10.2514/6.2018-0927","url":null,"abstract":"In computational materials science, mechanical properties are typically extracted from simulations by means of analysis routines that seek to mimic their experimental counterparts. However, simulated data often exhibit uncertainties that can propagate into final predictions in unexpected ways. Thus, modelers require data analysis tools that (i) address the problems posed by simulated data, and (ii) facilitate uncertainty quantification. In this manuscript, we discuss three case studies in materials modeling where careful data analysis can be leveraged to address specific instances of these issues. As a unifying theme, we highlight the idea that attention to physical and mathematical constraints surrounding the generation of computational data can significantly enhance its analysis.","PeriodicalId":308994,"journal":{"name":"arXiv: Data Analysis, Statistics and Probability","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133281964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The case for preserving our knowledge and data in physics experiments","authors":"F. Berghaus","doi":"10.3204/DESY-PROC-2017-02/BERGHAUS_FRANK","DOIUrl":"https://doi.org/10.3204/DESY-PROC-2017-02/BERGHAUS_FRANK","url":null,"abstract":"Berghaus, Frank \"The case for preserving our knowledge and data in physics experiments\" in Proceedings of the 13th \"Patras\" Workshop on Axions, WIMPs and WISPs, PATRAS 2017 / Maroudas, Marios (eds.), Verlag Deutsches Elektronen-Synchrotron : 2018 ; Patras 2017 : 13th Patras Workshop on Axions, WIMPs and WISPs, 2017-05-15 - 2017-05-19, Thessaloniki 13th Patras Workshop on Axions, WIMPs and WISPs, AXION-WIMP 2017, Thessaloniki, Greece, 15 May 2017 - 19 May 2017; Hamburg : Verlag Deutsches Elektronen-Synchrotron, DESY-PROC, 191-195 (2018). doi:10.3204/DESY-PROC-2017-02/berghaus_frank","PeriodicalId":308994,"journal":{"name":"arXiv: Data Analysis, Statistics and Probability","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121990868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Machine and deep learning techniques in heavy-ion collisions with ALICE","authors":"R. Haake","doi":"10.22323/1.314.0498","DOIUrl":"https://doi.org/10.22323/1.314.0498","url":null,"abstract":"Over the last years, machine learning tools have been successfully applied to a wealth of problems in high-energy physics. A typical example is the classification of physics objects. Supervised machine learning methods allow for significant improvements in classification problems by taking into account observable correlations and by learning the optimal selection from examples, e.g. from Monte Carlo simulations. Even more promising is the usage of deep learning techniques. Methods like deep convolutional networks might be able to catch features from low-level parameters that are not exploited by default cut-based methods. \u0000These ideas could be particularly beneficial for measurements in heavy-ion collisions, because of the very large multiplicities. Indeed, machine learning methods potentially perform much better in systems with a large number of degrees of freedom compared to cut-based methods. Moreover, many key heavy-ion observables are most interesting at low transverse momentum where the underlying event is dominant and the signal-to-noise ratio is quite low. \u0000In this work, recent developments of machine- and deep learning applications in heavy-ion collisions with ALICE will be presented, with focus on a deep learning-based b-jet tagging approach and the measurement of low-mass dielectrons. While the b-jet tagger is based on a mixture of shallow fully-connected and deep convolutional networks, the low-mass dielectron measurement uses gradient boosting and shallow neural networks. Both methods are very promising compared to default cut-based methods.","PeriodicalId":308994,"journal":{"name":"arXiv: Data Analysis, Statistics and Probability","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126691569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Combination of various data analysis techniques for efficient track reconstruction in very high multiplicity events","authors":"F. Siklér","doi":"10.1051/EPJCONF/201715000011","DOIUrl":"https://doi.org/10.1051/EPJCONF/201715000011","url":null,"abstract":"A novel combination of established data analysis techniques for reconstructing charged-particles in high energy collisions is proposed. It uses all information available in a collision event while keeping competing choices open as long as possible. Suitable track candidates are selected by transforming measured hits to a binned, three- or four-dimensional, track parameter space. It is accomplished by the use of templates taking advantage of the translational and rotational symmetries of the detectors. Track candidates and their corresponding hits, the nodes, form a usually highly connected network, a bipartite graph, where we allow for multiple hit to track assignments, edges. In order to get a manageable problem, the graph is cut into very many minigraphs by removing a few of its vulnerable components, edges and nodes. Finally the hits are distributed among the track candidates by exploring a deterministic decision tree. A depth-limited search is performed maximizing the number of hits on tracks, and minimizing the sum of track-fit χ 2 . Simplified but realistic models of LHC silicon trackers including the relevant physics processes are used to test and study the performance (efficiency, purity, timing) of the proposed method in the case of single or many simultaneous proton-proton collisions (high pileup), and for single heavy-ion collisions at the highest available energies.","PeriodicalId":308994,"journal":{"name":"arXiv: Data Analysis, Statistics and Probability","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133646462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}