{"title":"学习视觉抽象推理的多样化表征","authors":"Kai Zhao, Yao Zhu, Bailu Si","doi":"10.1007/s10462-025-11372-x","DOIUrl":null,"url":null,"abstract":"<div><p>Learning effective representations suitable for decision making in high-level cognitive space is crucial for visual abstract reasoning tasks. The visual system of the mammalian brain is organized into parallel networks that can be roughly classified in dichotomy as the dorsal and ventral streams. How do parallel networks learn efficient representations for cognitive tasks is still an elusive question. We propose the Information Competition Learning Network (ICNet) within a mutual information-constrained framework to learn diversified representations for visual abstract reasoning tasks. ICNet comprises a representation learning module and a rule extractor module. The representation learning module learns two complementary sets of representation under different constraints. These two sets compete to prevent from learning what the other has learned, thereby minimizing mutual predictability. Subsequently, these sets are combined synergistically and relayed to the rule extractor module, where discrete abstract rules are formed to predict the correct option. Empirical experiments consistently show that ICNet achieves superior results across several visual abstract reasoning datasets. Additionally, in Out-of-Distribution relationship reasoning benchmarks, ICNet demonstrates robust generalization ability.</p></div>","PeriodicalId":8449,"journal":{"name":"Artificial Intelligence Review","volume":"58 12","pages":""},"PeriodicalIF":13.9000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10462-025-11372-x.pdf","citationCount":"0","resultStr":"{\"title\":\"Learning diversified representations for visual abstract reasoning\",\"authors\":\"Kai Zhao, Yao Zhu, Bailu Si\",\"doi\":\"10.1007/s10462-025-11372-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Learning effective representations suitable for decision making in high-level cognitive space is crucial for visual abstract reasoning tasks. The visual system of the mammalian brain is organized into parallel networks that can be roughly classified in dichotomy as the dorsal and ventral streams. How do parallel networks learn efficient representations for cognitive tasks is still an elusive question. We propose the Information Competition Learning Network (ICNet) within a mutual information-constrained framework to learn diversified representations for visual abstract reasoning tasks. ICNet comprises a representation learning module and a rule extractor module. The representation learning module learns two complementary sets of representation under different constraints. These two sets compete to prevent from learning what the other has learned, thereby minimizing mutual predictability. Subsequently, these sets are combined synergistically and relayed to the rule extractor module, where discrete abstract rules are formed to predict the correct option. Empirical experiments consistently show that ICNet achieves superior results across several visual abstract reasoning datasets. Additionally, in Out-of-Distribution relationship reasoning benchmarks, ICNet demonstrates robust generalization ability.</p></div>\",\"PeriodicalId\":8449,\"journal\":{\"name\":\"Artificial Intelligence Review\",\"volume\":\"58 12\",\"pages\":\"\"},\"PeriodicalIF\":13.9000,\"publicationDate\":\"2025-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://link.springer.com/content/pdf/10.1007/s10462-025-11372-x.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial Intelligence Review\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10462-025-11372-x\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence Review","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10462-025-11372-x","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Learning diversified representations for visual abstract reasoning
Learning effective representations suitable for decision making in high-level cognitive space is crucial for visual abstract reasoning tasks. The visual system of the mammalian brain is organized into parallel networks that can be roughly classified in dichotomy as the dorsal and ventral streams. How do parallel networks learn efficient representations for cognitive tasks is still an elusive question. We propose the Information Competition Learning Network (ICNet) within a mutual information-constrained framework to learn diversified representations for visual abstract reasoning tasks. ICNet comprises a representation learning module and a rule extractor module. The representation learning module learns two complementary sets of representation under different constraints. These two sets compete to prevent from learning what the other has learned, thereby minimizing mutual predictability. Subsequently, these sets are combined synergistically and relayed to the rule extractor module, where discrete abstract rules are formed to predict the correct option. Empirical experiments consistently show that ICNet achieves superior results across several visual abstract reasoning datasets. Additionally, in Out-of-Distribution relationship reasoning benchmarks, ICNet demonstrates robust generalization ability.
期刊介绍:
Artificial Intelligence Review, a fully open access journal, publishes cutting-edge research in artificial intelligence and cognitive science. It features critical evaluations of applications, techniques, and algorithms, providing a platform for both researchers and application developers. The journal includes refereed survey and tutorial articles, along with reviews and commentary on significant developments in the field.