Dianlong You , Zexuan Li , Jiawei Shen , Zhao Yu , Shunfu Jin , Xindong Wu
{"title":"变分自编码器中具有因果效应传递的解纠缠表征学习","authors":"Dianlong You , Zexuan Li , Jiawei Shen , Zhao Yu , Shunfu Jin , Xindong Wu","doi":"10.1016/j.patcog.2025.112018","DOIUrl":null,"url":null,"abstract":"<div><div>Disentangled Representation Learning in variational autoencoder (VAE) has emerged as a strategy to identify and disentangle underlying factors from observable data to improve recognition capabilities such as images, speeches, and biological signals. Existing disentanglement methods are mostly based on the prior assumption that latent variables are mutually independent, which is inconsistent with reality and fails to transmit causal effects among causal nodes. To address the above issues, we introduce a novel disentanglement representation learning model with causal effect transmission, named DRL<span><math><msub><mrow></mrow><mrow><mi>CET</mi></mrow></msub></math></span>. The main ideas of DRL<span><math><msub><mrow></mrow><mrow><mi>CET</mi></mrow></msub></math></span> involve (1) mapping encoded latent exogenous variables to causal variables and updating the causal structure by a constructed nonlinear/linear structural causal model (SCM), (2) designing hierarchical feature loss from discriminator to replace pixel-level loss in variational autoencoder for efficiently extracting causal features, and (3) aggregating causal information from adjacent nodes by a graph attention network (GAT) with intervention for transmitting causal effects. Extensive theoretical analyses and empirical studies on synthetic and real datasets demonstrate the effectiveness, viability, and superiority of our DRL<span><math><msub><mrow></mrow><mrow><mi>CET</mi></mrow></msub></math></span> over the state-of-the-arts. Our code is publicly available at <span><span>https://github.com/youdianlong/DRLCET.git</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"170 ","pages":"Article 112018"},"PeriodicalIF":7.6000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Disentangled representation learning with causal effect transmission in variational autoencoder\",\"authors\":\"Dianlong You , Zexuan Li , Jiawei Shen , Zhao Yu , Shunfu Jin , Xindong Wu\",\"doi\":\"10.1016/j.patcog.2025.112018\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Disentangled Representation Learning in variational autoencoder (VAE) has emerged as a strategy to identify and disentangle underlying factors from observable data to improve recognition capabilities such as images, speeches, and biological signals. Existing disentanglement methods are mostly based on the prior assumption that latent variables are mutually independent, which is inconsistent with reality and fails to transmit causal effects among causal nodes. To address the above issues, we introduce a novel disentanglement representation learning model with causal effect transmission, named DRL<span><math><msub><mrow></mrow><mrow><mi>CET</mi></mrow></msub></math></span>. The main ideas of DRL<span><math><msub><mrow></mrow><mrow><mi>CET</mi></mrow></msub></math></span> involve (1) mapping encoded latent exogenous variables to causal variables and updating the causal structure by a constructed nonlinear/linear structural causal model (SCM), (2) designing hierarchical feature loss from discriminator to replace pixel-level loss in variational autoencoder for efficiently extracting causal features, and (3) aggregating causal information from adjacent nodes by a graph attention network (GAT) with intervention for transmitting causal effects. Extensive theoretical analyses and empirical studies on synthetic and real datasets demonstrate the effectiveness, viability, and superiority of our DRL<span><math><msub><mrow></mrow><mrow><mi>CET</mi></mrow></msub></math></span> over the state-of-the-arts. Our code is publicly available at <span><span>https://github.com/youdianlong/DRLCET.git</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":49713,\"journal\":{\"name\":\"Pattern Recognition\",\"volume\":\"170 \",\"pages\":\"Article 112018\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-06-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0031320325006788\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325006788","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Disentangled representation learning with causal effect transmission in variational autoencoder
Disentangled Representation Learning in variational autoencoder (VAE) has emerged as a strategy to identify and disentangle underlying factors from observable data to improve recognition capabilities such as images, speeches, and biological signals. Existing disentanglement methods are mostly based on the prior assumption that latent variables are mutually independent, which is inconsistent with reality and fails to transmit causal effects among causal nodes. To address the above issues, we introduce a novel disentanglement representation learning model with causal effect transmission, named DRL. The main ideas of DRL involve (1) mapping encoded latent exogenous variables to causal variables and updating the causal structure by a constructed nonlinear/linear structural causal model (SCM), (2) designing hierarchical feature loss from discriminator to replace pixel-level loss in variational autoencoder for efficiently extracting causal features, and (3) aggregating causal information from adjacent nodes by a graph attention network (GAT) with intervention for transmitting causal effects. Extensive theoretical analyses and empirical studies on synthetic and real datasets demonstrate the effectiveness, viability, and superiority of our DRL over the state-of-the-arts. Our code is publicly available at https://github.com/youdianlong/DRLCET.git.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.