{"title":"通过估算单变量和多变量时间序列之间的转移熵识别交互网络中的影响节点和脆弱节点","authors":"Julian Lee","doi":"arxiv-2408.15811","DOIUrl":null,"url":null,"abstract":"Transfer entropy (TE) is a powerful tool for measuring causal relationships\nwithin interaction networks. Traditionally, TE and its conditional variants are\napplied pairwise between dynamic variables to infer these causal relationships.\nHowever, identifying the most influential or vulnerable node in a system\nrequires measuring the causal influence of each component on the entire system\nand vice versa. In this paper, I propose using outgoing and incoming transfer\nentropy-where outgoing TE quantifies the influence of a node on the rest of the\nsystem, and incoming TE measures the influence of the rest of the system on the\nnode. The node with the highest outgoing TE is identified as the most\ninfluential, or \"hub\", while the node with the highest incoming TE is the most\nvulnerable, or \"anti-hub\". Since these measures involve transfer entropy\nbetween univariate and multivariate time series, naive estimation methods can\nresult in significant errors, particularly when the number of variables is\ncomparable to or exceeds the number of samples. To address this, I introduce a\nnovel estimation scheme that computes outgoing and incoming TE only between\nsignificantly interacting partners. The feasibility of this approach is\ndemonstrated by using synthetic data, and by applying it to a real data of oral\nmicrobiota. The method successfully identifies the bacterial species known to\nbe key players in the bacterial community, demonstrating the power of the new\nmethod.","PeriodicalId":501040,"journal":{"name":"arXiv - PHYS - Biological Physics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identifying Influential and Vulnerable Nodes in Interaction Networks through Estimation of Transfer Entropy Between Univariate and Multivariate Time Series\",\"authors\":\"Julian Lee\",\"doi\":\"arxiv-2408.15811\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Transfer entropy (TE) is a powerful tool for measuring causal relationships\\nwithin interaction networks. Traditionally, TE and its conditional variants are\\napplied pairwise between dynamic variables to infer these causal relationships.\\nHowever, identifying the most influential or vulnerable node in a system\\nrequires measuring the causal influence of each component on the entire system\\nand vice versa. In this paper, I propose using outgoing and incoming transfer\\nentropy-where outgoing TE quantifies the influence of a node on the rest of the\\nsystem, and incoming TE measures the influence of the rest of the system on the\\nnode. The node with the highest outgoing TE is identified as the most\\ninfluential, or \\\"hub\\\", while the node with the highest incoming TE is the most\\nvulnerable, or \\\"anti-hub\\\". Since these measures involve transfer entropy\\nbetween univariate and multivariate time series, naive estimation methods can\\nresult in significant errors, particularly when the number of variables is\\ncomparable to or exceeds the number of samples. To address this, I introduce a\\nnovel estimation scheme that computes outgoing and incoming TE only between\\nsignificantly interacting partners. The feasibility of this approach is\\ndemonstrated by using synthetic data, and by applying it to a real data of oral\\nmicrobiota. The method successfully identifies the bacterial species known to\\nbe key players in the bacterial community, demonstrating the power of the new\\nmethod.\",\"PeriodicalId\":501040,\"journal\":{\"name\":\"arXiv - PHYS - Biological Physics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - PHYS - Biological Physics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.15811\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Biological Physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.15811","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Identifying Influential and Vulnerable Nodes in Interaction Networks through Estimation of Transfer Entropy Between Univariate and Multivariate Time Series
Transfer entropy (TE) is a powerful tool for measuring causal relationships
within interaction networks. Traditionally, TE and its conditional variants are
applied pairwise between dynamic variables to infer these causal relationships.
However, identifying the most influential or vulnerable node in a system
requires measuring the causal influence of each component on the entire system
and vice versa. In this paper, I propose using outgoing and incoming transfer
entropy-where outgoing TE quantifies the influence of a node on the rest of the
system, and incoming TE measures the influence of the rest of the system on the
node. The node with the highest outgoing TE is identified as the most
influential, or "hub", while the node with the highest incoming TE is the most
vulnerable, or "anti-hub". Since these measures involve transfer entropy
between univariate and multivariate time series, naive estimation methods can
result in significant errors, particularly when the number of variables is
comparable to or exceeds the number of samples. To address this, I introduce a
novel estimation scheme that computes outgoing and incoming TE only between
significantly interacting partners. The feasibility of this approach is
demonstrated by using synthetic data, and by applying it to a real data of oral
microbiota. The method successfully identifies the bacterial species known to
be key players in the bacterial community, demonstrating the power of the new
method.