{"title":"The complexity of boolean failure identification","authors":"Nicola Galesi , Fariba Ranjbar","doi":"10.1016/j.tcs.2025.115536","DOIUrl":null,"url":null,"abstract":"<div><div>We consider the problem of identifying failure nodes in networks under the <em>Boolean Network Tomography</em> (<span><math><mi>BNT</mi></math></span>) approach, which is based on end-to-end measurements routed in a network along paths and producing a boolean (failure/not-failure) outcome. Such end-to-end measurements paths are usually described by an incidence boolean matrix <span><math><mi>M</mi></math></span> with <span><math><mi>m</mi></math></span> rows (the measurements paths) and <span><math><mi>n</mi></math></span> columns (the nodes of the network). A key notion used in practice in this approach is that of <span><math><mi>k</mi></math></span>-<em>identifiability</em>. Loosely speaking, a set of <span><math><mi>m</mi></math></span> boolean measurements paths over <span><math><mi>n</mi></math></span> nodes is <span><math><mi>k</mi></math></span>-identifiable, where <span><math><mi>k</mi></math></span> is a non-negative integer, if, whenever there are fewer than <span><math><mrow><mi>k</mi><mo>+</mo><mn>1</mn></mrow></math></span> failures, it is always possible to identify unambiguously and uniquely which nodes are failing.</div><div>Following the focus of some recent results analyzing maximal identifiability from a theoretical point of view, this work establishes the complexity of the optimization problem that determines the <em>maximal</em> <span><math><mi>k</mi></math></span> for which a set of measurement paths is <span><math><mi>k</mi></math></span>-identifiable (<span><math><mi>MID</mi></math></span>). We prove that such problem is <span><math><mi>NP</mi></math></span>-hard by a reduction from the <em>Minimum Hitting Set</em> problem and we prove that its decision version is in <span><math><mi>NP</mi></math></span>.</div><div>We further consider the following extremal combinatoric question, which is also of practical relevance: given the number <span><math><mi>n</mi></math></span> of nodes of the network and a non-negative integer value <span><math><mi>k</mi></math></span> for the identifiability, what is the minimal number <span><math><mi>m</mi></math></span> of measurement paths over the <span><math><mi>n</mi></math></span> nodes to consider in such a way that the maximal identifiability is at least <span><math><mi>k</mi></math></span>? A folklore result shows that to have maximal identifiability at least 1, then <span><math><mrow><mi>m</mi><mo>≥</mo><mi>log</mi><mo>(</mo><mi>n</mi><mo>+</mo><mn>1</mn><mo>)</mo></mrow></math></span> (or, equivalently, that if <span><math><mrow><mi>n</mi><mo>></mo><msup><mn>2</mn><mi>m</mi></msup><mo>−</mo><mn>1</mn></mrow></math></span>, then the maximal identifiability is less than or equal 0). In this work we answer this question for each <span><math><mrow><mi>n</mi><mo>∈</mo><mi>N</mi></mrow></math></span> and for each <span><math><mrow><mi>k</mi><mo>≥</mo><mn>2</mn></mrow></math></span>, proving that, there is constant <span><math><mi>C</mi></math></span> such that if <span><math><mrow><mi>n</mi><mo>></mo><mi>C</mi><msup><mi>m</mi><mrow><mn>1</mn><mo>+</mo><mfrac><mi>m</mi><mrow><mi>k</mi><mo>−</mo><mn>1</mn></mrow></mfrac></mrow></msup></mrow></math></span>, then the maximal identifiability value is strictly smaller than <span><math><mi>k</mi></math></span> (and when <span><math><mrow><mi>k</mi><mo>=</mo><mn>2</mn></mrow></math></span>, <span><math><mrow><mi>n</mi><mo>></mo><mi>C</mi><msup><mi>m</mi><mi>m</mi></msup></mrow></math></span> suffices).</div><div>Finally, we study upper and lower bounds on the number of unambiguously identifiable nodes, introducing new identifiability conditions which strictly imply and are strictly implied by unambiguous identifiability. We use these new conditions to design algorithmic heuristics to count defective nodes in a fine-grained way. In particular we introduce a random model to study lower bounds on the number of unambiguously identifiable defective nodes and we use this model to estimate lower bounds on the number of identifiable nodes on real networks by a maximum likelihood estimate approach.</div></div>","PeriodicalId":49438,"journal":{"name":"Theoretical Computer Science","volume":"1056 ","pages":"Article 115536"},"PeriodicalIF":1.0000,"publicationDate":"2025-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theoretical Computer Science","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0304397525004748","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
We consider the problem of identifying failure nodes in networks under the Boolean Network Tomography () approach, which is based on end-to-end measurements routed in a network along paths and producing a boolean (failure/not-failure) outcome. Such end-to-end measurements paths are usually described by an incidence boolean matrix with rows (the measurements paths) and columns (the nodes of the network). A key notion used in practice in this approach is that of -identifiability. Loosely speaking, a set of boolean measurements paths over nodes is -identifiable, where is a non-negative integer, if, whenever there are fewer than failures, it is always possible to identify unambiguously and uniquely which nodes are failing.
Following the focus of some recent results analyzing maximal identifiability from a theoretical point of view, this work establishes the complexity of the optimization problem that determines the maximal for which a set of measurement paths is -identifiable (). We prove that such problem is -hard by a reduction from the Minimum Hitting Set problem and we prove that its decision version is in .
We further consider the following extremal combinatoric question, which is also of practical relevance: given the number of nodes of the network and a non-negative integer value for the identifiability, what is the minimal number of measurement paths over the nodes to consider in such a way that the maximal identifiability is at least ? A folklore result shows that to have maximal identifiability at least 1, then (or, equivalently, that if , then the maximal identifiability is less than or equal 0). In this work we answer this question for each and for each , proving that, there is constant such that if , then the maximal identifiability value is strictly smaller than (and when , suffices).
Finally, we study upper and lower bounds on the number of unambiguously identifiable nodes, introducing new identifiability conditions which strictly imply and are strictly implied by unambiguous identifiability. We use these new conditions to design algorithmic heuristics to count defective nodes in a fine-grained way. In particular we introduce a random model to study lower bounds on the number of unambiguously identifiable defective nodes and we use this model to estimate lower bounds on the number of identifiable nodes on real networks by a maximum likelihood estimate approach.
期刊介绍:
Theoretical Computer Science is mathematical and abstract in spirit, but it derives its motivation from practical and everyday computation. Its aim is to understand the nature of computation and, as a consequence of this understanding, provide more efficient methodologies. All papers introducing or studying mathematical, logic and formal concepts and methods are welcome, provided that their motivation is clearly drawn from the field of computing.