{"title":"面向社会化多媒体知识发现的分层信息压缩方法","authors":"Zheng Liu;Yu Weng;Ruiyang Xu;Chaomurilige;Honghao Gao","doi":"10.1109/TCSS.2024.3440997","DOIUrl":null,"url":null,"abstract":"Knowledge discovery is an ongoing research endeavor aimed at uncovering valuable insights and patterns from large volumes of data in massive social systems (MSSs). Although recent advances in deep learning have made significant progress in knowledge discovery, the “data dimensionality reduction” problem still poses practical challenges. To address this, we have introduced a hierarchical information compression (IC) approach, which emphasizes the elimination of redundant and irrelevant features and the generation of high-quality knowledge representation, aiming to enhance the information density of the knowledge discovery process. Our approach consists of coarse-grained and fine-grained stages for data compression. In the coarse-grained stage, our method employs the key feature distiller based on the Siamese network to effectively identify a substantial number of irrelevant features and latent redundancies within coarse-grained data blocks. Moving on to the fine-grained stage, our model further compresses the internal features of the data, extracting the most crucial knowledge and facilitating data compression by cross-block learning. By implementing these two stages, the approach achieves both inter and innerblock IC while preserving essential knowledge. To validate the performance of our proposed model, we conducted several experiments using WikiSum, a large knowledge corpus based on English Wikipedia in MSSs. The experimental results demonstrate that our model achieved a 2.38% increase on recall-oriented understudy for gisting evaluation (ROUGE)-2 and an improvement of over 7% on the informativeness and conciseness metrics, as evidenced by the improved scores obtained from both automatic and human evaluations. The experimental results prove that our model can effectively select the most pertinent and meaningful content and reduce the redundancy to generate better knowledge representation.","PeriodicalId":13044,"journal":{"name":"IEEE Transactions on Computational Social Systems","volume":"11 6","pages":"7754-7765"},"PeriodicalIF":4.5000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Hierarchical Information Compression Approach for Knowledge Discovery From Social Multimedia\",\"authors\":\"Zheng Liu;Yu Weng;Ruiyang Xu;Chaomurilige;Honghao Gao\",\"doi\":\"10.1109/TCSS.2024.3440997\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Knowledge discovery is an ongoing research endeavor aimed at uncovering valuable insights and patterns from large volumes of data in massive social systems (MSSs). Although recent advances in deep learning have made significant progress in knowledge discovery, the “data dimensionality reduction” problem still poses practical challenges. To address this, we have introduced a hierarchical information compression (IC) approach, which emphasizes the elimination of redundant and irrelevant features and the generation of high-quality knowledge representation, aiming to enhance the information density of the knowledge discovery process. Our approach consists of coarse-grained and fine-grained stages for data compression. In the coarse-grained stage, our method employs the key feature distiller based on the Siamese network to effectively identify a substantial number of irrelevant features and latent redundancies within coarse-grained data blocks. Moving on to the fine-grained stage, our model further compresses the internal features of the data, extracting the most crucial knowledge and facilitating data compression by cross-block learning. By implementing these two stages, the approach achieves both inter and innerblock IC while preserving essential knowledge. To validate the performance of our proposed model, we conducted several experiments using WikiSum, a large knowledge corpus based on English Wikipedia in MSSs. The experimental results demonstrate that our model achieved a 2.38% increase on recall-oriented understudy for gisting evaluation (ROUGE)-2 and an improvement of over 7% on the informativeness and conciseness metrics, as evidenced by the improved scores obtained from both automatic and human evaluations. The experimental results prove that our model can effectively select the most pertinent and meaningful content and reduce the redundancy to generate better knowledge representation.\",\"PeriodicalId\":13044,\"journal\":{\"name\":\"IEEE Transactions on Computational Social Systems\",\"volume\":\"11 6\",\"pages\":\"7754-7765\"},\"PeriodicalIF\":4.5000,\"publicationDate\":\"2024-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Computational Social Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10670458/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, CYBERNETICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Computational Social Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10670458/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, CYBERNETICS","Score":null,"Total":0}
A Hierarchical Information Compression Approach for Knowledge Discovery From Social Multimedia
Knowledge discovery is an ongoing research endeavor aimed at uncovering valuable insights and patterns from large volumes of data in massive social systems (MSSs). Although recent advances in deep learning have made significant progress in knowledge discovery, the “data dimensionality reduction” problem still poses practical challenges. To address this, we have introduced a hierarchical information compression (IC) approach, which emphasizes the elimination of redundant and irrelevant features and the generation of high-quality knowledge representation, aiming to enhance the information density of the knowledge discovery process. Our approach consists of coarse-grained and fine-grained stages for data compression. In the coarse-grained stage, our method employs the key feature distiller based on the Siamese network to effectively identify a substantial number of irrelevant features and latent redundancies within coarse-grained data blocks. Moving on to the fine-grained stage, our model further compresses the internal features of the data, extracting the most crucial knowledge and facilitating data compression by cross-block learning. By implementing these two stages, the approach achieves both inter and innerblock IC while preserving essential knowledge. To validate the performance of our proposed model, we conducted several experiments using WikiSum, a large knowledge corpus based on English Wikipedia in MSSs. The experimental results demonstrate that our model achieved a 2.38% increase on recall-oriented understudy for gisting evaluation (ROUGE)-2 and an improvement of over 7% on the informativeness and conciseness metrics, as evidenced by the improved scores obtained from both automatic and human evaluations. The experimental results prove that our model can effectively select the most pertinent and meaningful content and reduce the redundancy to generate better knowledge representation.
期刊介绍:
IEEE Transactions on Computational Social Systems focuses on such topics as modeling, simulation, analysis and understanding of social systems from the quantitative and/or computational perspective. "Systems" include man-man, man-machine and machine-machine organizations and adversarial situations as well as social media structures and their dynamics. More specifically, the proposed transactions publishes articles on modeling the dynamics of social systems, methodologies for incorporating and representing socio-cultural and behavioral aspects in computational modeling, analysis of social system behavior and structure, and paradigms for social systems modeling and simulation. The journal also features articles on social network dynamics, social intelligence and cognition, social systems design and architectures, socio-cultural modeling and representation, and computational behavior modeling, and their applications.