{"title":"Category-guided multi-interest collaborative metric learning with representation uniformity constraints","authors":"Long Wang, Tao Lian","doi":"10.1016/j.ipm.2024.103937","DOIUrl":"10.1016/j.ipm.2024.103937","url":null,"abstract":"<div><div>Multi-interest collaborative metric learning has recently emerged as an effective approach to modeling the multifaceted interests of a user in recommender systems. However, two issues remain unexplored. (1) There is no explicit guidance for the matching of an item against multiple interest vectors of a user. (2) The desired property of item representations with respect to their categories is overlooked, resulting in that different categories of items are mixed up in the latent space. To overcome these issues, we devise a Category-guided Multi-interest Collaborative Metric Learning model (CMCML) with representation uniformity constraints. CMCML is designed as a novel category-guided Mixture-of-Experts (MoE) architecture, where the gating network leverages the item category to guide the matching of an item against multiple interest vectors of a user, encouraging items with the same category to approach the same interest vector. In addition, we design a user multi-interest uniformity loss and a category-aware item uniformity loss: The former aims to avoid representation degeneration by enlarging the difference among multiple interest vectors of the same user; the latter is tailored to push different categories of items apart in the latent space. Quantitative experiments on Ciao, Epinions and TaFeng demonstrate that our CMCML improves the value of NDCG@20 by 12.41%, 10.89% and 10.39% respectively, compared to other state-of-the-art collaborative metric learning methods. Further qualitative analyses reveal that our CMCML yields a better representation space where items from distinct categories are arranged in different regions with high density.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103937"},"PeriodicalIF":7.4,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142536067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Cuifang Zou, Guangquan Lu, Longqing Du, Xuxia Zeng, Shilong Lin
{"title":"Graph similarity learning for cross-level interactions","authors":"Cuifang Zou, Guangquan Lu, Longqing Du, Xuxia Zeng, Shilong Lin","doi":"10.1016/j.ipm.2024.103932","DOIUrl":"10.1016/j.ipm.2024.103932","url":null,"abstract":"<div><div>Graph similarity computation is crucial in fields such as bioinformatics, e.g., identifying compounds with similar biological activities by comparing molecular structural similarities. Traditional methods such as graph edit distance (GED) and maximal common subgraphs suffer from high computational complexity and sensitivity to noise, which limit their practical applications. Existing deep learning methods make it difficult to extract graph features, which affects computational accuracy comprehensively. To address these problems, we propose a new method, CLSim, which improves performance by enhancing feature extraction and improving graph similarity computation. Using the attention mechanism, CLSim first aligns graph pair features to the shared space and aggregates node features into global embeddings. The directionality of the embedding vectors is considered when extracting graph-level features to handle more complex data. In addition, we develop cross-layer feature extraction techniques that combine node-level information with graph-level embeddings to capture detailed node-graph interaction details. Experimental results on three datasets show that CLSim has excellent generalization capabilities and achieves lower error rates compared to the GED approach and the graph neural network baseline. In the worst case, its time complexity remains quadratic. Example query results further validate the effectiveness of the model, providing a more efficient and accurate solutions for graph similarity tasks.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103932"},"PeriodicalIF":7.4,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142536065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Haijun He, Bobo Li, Yiyun Xiong, Li Zheng, Kang He, Fei Li, Donghong Ji
{"title":"Heuristic personality recognition based on fusing multiple conversations and utterance-level affection","authors":"Haijun He, Bobo Li, Yiyun Xiong, Li Zheng, Kang He, Fei Li, Donghong Ji","doi":"10.1016/j.ipm.2024.103931","DOIUrl":"10.1016/j.ipm.2024.103931","url":null,"abstract":"<div><div><strong>P</strong>ersonality <strong>R</strong>ecognition in <strong>C</strong>onversations (<strong>PRC</strong>) is a task of significant interest and practical value. Existing studies on the PRC task utilize conversation inadequately and neglect affective information. Considering the way of information processing of these studies is not yet close enough to the concept of personality, we propose the SAH-GCN model for the PRC task in this study. This model initially processes the original conversation input to extract the central speaker feature. Leveraging Contrastive Learning, it continuously adjusts the embedding of each utterance by incorporating affective information to cope with the semantic similarity. Subsequently, the model employs Graph Convolutional Networks to simulate the conversation dynamics, ensuring comprehensive interaction between the central speaker feature and other relevant features. Lastly, it heuristically fuses central speaker features from multiple conversations involving the same speaker into one comprehensive feature, facilitating personality recognition. We conduct experiments using the recently released CPED dataset, which is the personality dataset encompassing affection labels and conversation details. Our results demonstrate that SAH-GCN achieves superior accuracy (+1.88%) compared to prior works on the PRC task. Further analysis verifies the efficacy of our scheme that fuses multiple conversations and incorporates affective information for personality recognition.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103931"},"PeriodicalIF":7.4,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142536066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"LacGCL: Lightweight message masking with linear attention and cross-view interaction graph contrastive learning for recommendation","authors":"Haohe Jia , Peng Hou , Yong Zhou , Hongbin Zhu , Hongfeng Chai","doi":"10.1016/j.ipm.2024.103930","DOIUrl":"10.1016/j.ipm.2024.103930","url":null,"abstract":"<div><div>Graph contrastive learning (GCL) has recently attracted significant attention in the field of recommender systems. However, many GCL methods aim to enhance recommendation accuracy by employing dense matrix operations and frequent manipulation of graph structures to generate contrast views, leading to substantial computational resource consumption. While simpler GCL methods have lower computational costs, they fail to fully exploit collaborative filtering information, leading to reduced accuracy. On the other hand, more complex adaptive methods achieve higher accuracy but at the expense of significantly greater computational cost. Consequently, there exists a considerable gap in accuracy between these lightweight models and the more complex GCL methods focused on high accuracy.</div><div>To address this issue and achieve high predictive accuracy while maintaining low computational cost, we propose a novel method that incorporates attention-wise graph reconstruction with message masking and cross-view interaction for contrastive learning. The attention-wise graph reconstruction with message masking preserves the structural and semantic information of the graph while mitigating the overfitting problem. Linear attention ensures that the algorithm’s complexity remains low. Furthermore, the cross-view interaction is capable of capturing more high-quality latent features. Our results, validated on four datasets, demonstrate that the proposed method maintains a lightweight computational cost and significantly outperforms the baseline methods in recommendation accuracy.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103930"},"PeriodicalIF":7.4,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142536247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Heng-yang Lu , Tian-ci Liu , Rui Cong , Jun Yang , Qiang Gan , Wei Fang , Xiao-jun Wu
{"title":"QAIE: LLM-based Quantity Augmentation and Information Enhancement for few-shot Aspect-Based Sentiment Analysis","authors":"Heng-yang Lu , Tian-ci Liu , Rui Cong , Jun Yang , Qiang Gan , Wei Fang , Xiao-jun Wu","doi":"10.1016/j.ipm.2024.103917","DOIUrl":"10.1016/j.ipm.2024.103917","url":null,"abstract":"<div><div>Aspect-based Sentiment Analysis (ABSA) aims to extract fine-grained sentiment information from online reviews. Few-shot ABSA faces challenges with limited labeled data and recent generative models have outperformed traditional classification models. Existing methods use Question Answering (QA) templates with Text-to-Text Transfer Transformer (T5) to extract sentiment elements, introducing a generative sentiment analysis paradigm. However, these models often fail to fully grasp ABSA rules, generating non-standard or incorrect outputs. This issue also arises with large language models (LLMs) due to insufficient labeled data for tuning and learning. Additionally, ABSA datasets often include many short, uninformative reviews, complicating sentiment element extraction in few-shot scenarios. This paper addresses two major challenges in few-shot ABSA: (1) <em>How to let the generative model well understand the ABSA rules under few-shot scenarios</em>. (2) <em>How to enhance the review text with richer information</em>. We propose a <strong>Q</strong>uantity <strong>A</strong>ugmentation and <strong>I</strong>nformation <strong>E</strong>nhancement (<strong>QAIE</strong>) approach, leveraging LLMs to generate fluent texts and infer implicit information. First, we propose a quantity augmentation module, which leverages the large language model (LLM) to obtain sufficient labeled data for the generative model to learn the ABSA rules better. Then, we introduce an information enhancement module, which brings more informative input to the generative model by enhancing the information in the review. Comprehensive experiments on five ABSA tasks using three widely-used datasets demonstrate that our QAIE model achieves approximately 10% improvement over state-of-the-art models. Specifically, for the most challenging ASQP task, our LLM-based model is compared with the existing state-of-the-art models on datasets Rest15 and Rest16, achieving F1 gains of 9.42% and 6.45% respectively in the <span><math><mrow><mi>k</mi><mo>=</mo><mn>5</mn></mrow></math></span> few-shot setting.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103917"},"PeriodicalIF":7.4,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142536246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A hybrid feature fusion deep learning framework for multi-source medical image analysis","authors":"Qiang Cao , Xian Cheng","doi":"10.1016/j.ipm.2024.103934","DOIUrl":"10.1016/j.ipm.2024.103934","url":null,"abstract":"<div><div>Despite the widespread adoption of deep learning to enhance image classification, significant obstacles remain. First, multisource data with diverse sizes and formats is a great challenge for most current deep learning models. Second, lacking manual labeled data for model training limits the application of deep learning. Third, the widely used CNN-based methods shows their limitations in extracting global features and yield poor performance for image topology. To address these issues, we propose a Hybrid Feature Fusion Deep Learning (HFFDL) framework for image classification. This framework consists of an automated image segmentation module, a two-stream backbone module, and a classification module. The automatic image segmentation module utilizes the U-Net model and transfer learning to detect region of interest (ROI) in multisource images; the two-stream backbone module integrates the Swin Transformer architecture with the Inception CNN, with the aim of simultaneous extracting local and global features for efficient representation learning. We evaluate the performance of HFFDL framework with two publicly available image datasets: one for identifying COVID-19 through X-ray scans of the chest (30,386 images), and another for multiclass skin cancer screening using dermoscopy images (25,331 images). The HFFDL framework exhibited greater performance in comparison to many cutting-edge models, achieving the AUC score 0.9835 and 0.8789, respectively. Furthermore, a practical application study conducted in a hospital, identifying viable embryos using medical images, revealed the HFFDL framework outperformed embryologists.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103934"},"PeriodicalIF":7.4,"publicationDate":"2024-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142535923","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jinjin Zhang, Qimeng Fan, Dingan Wang, Pu Huang, Zhangjing Yang
{"title":"Triple Sparse Denoising Discriminantive Least Squares Regression for image classification","authors":"Jinjin Zhang, Qimeng Fan, Dingan Wang, Pu Huang, Zhangjing Yang","doi":"10.1016/j.ipm.2024.103922","DOIUrl":"10.1016/j.ipm.2024.103922","url":null,"abstract":"<div><div>Discriminantive Least Squares Regression (DLSR) is an algorithm that employs <span><math><mi>ɛ</mi></math></span>-draggings techniques to enhance intra-class similarity. However, it overlooks that an increase in intra-class closeness may simultaneously lead to a decrease in the distance between similar but different classes. To address this issue, we propose a new approach called Triple Sparse Denoising Discriminantive Least Squares Regression (TSDDLSR), which combines three sparsity constraints: sparsity constraints between classes to amplify the growth of the distance between similar classes; sparsity constraints on relaxation matrices to capture more local structure; sparsity constraints on noise matrices to minimize the effect of outliers. In addition, we position the matrix decomposition step in the label space strategically with the objective of enhancing denoising capabilities, safeguarding it from potential degradation, and preserving its underlying manifold structure. Our experiments evaluate the classification performance of the method under face recognition tasks (AR, CMU PIE, Extended Yale B, Georgia Tech, FERET datasets), biometric recognition tasks (PolyU Palmprint dataset), and object recognition tasks (COIL-20, ImageNet datasets). Meanwhile, the results show that TSDDLSR significantly improves classification performance compared to existing methods.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103922"},"PeriodicalIF":7.4,"publicationDate":"2024-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142536026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lin Runhui , Li Yalin , Ji Ze , Xie Qiqi , Chen Xiaoyu
{"title":"Quantifying the degree of scientific innovation breakthrough: Considering knowledge trajectory change and impact","authors":"Lin Runhui , Li Yalin , Ji Ze , Xie Qiqi , Chen Xiaoyu","doi":"10.1016/j.ipm.2024.103933","DOIUrl":"10.1016/j.ipm.2024.103933","url":null,"abstract":"<div><div>Scientific breakthroughs have the potential to reshape the trajectory of knowledge flow and significantly impact later research. The aim of this study is to introduce the Degree of Innovation Breakthrough (DIB) metric to more accurately quantify the extent of scientific breakthroughs. The DIB metric takes into account changes in the trajectory of knowledge flow, as well as the deep and width of impact, and it modifies the traditional assumption of equal citation contributions by assigning weighted citation counts. The effectiveness of the DIB metric is assessed using ROC curves and AUC metrics, demonstrating its ability to differentiate between high and low scientific breakthroughs with high sensitivity and minimal false positives. Based on ROC curves, this study proposes a method to calculate the threshold for high scientific breakthrough, reducing subjectivity. The effectiveness of the proposed method is demonstrated through a dataset consisting of 1108 award-winning computer science papers and 9832 matched control papers, showing that the DIB metric surpasses single-dimensional metrics. The study also performs a granular analysis of the innovation breakthrough degree of non-award-winning papers, categorizing them into four types based on originality and impact through 2D histogram visualization, and suggests tailored management strategies. Through the adoption of this refined classification strategy, the management of innovation practices can be optimized, ultimately fostering the enhancement of innovative research outcomes. The quantitative tools introduced in this paper offer guidance for researchers in the fields of science intelligence mining and science trend prediction.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103933"},"PeriodicalIF":7.4,"publicationDate":"2024-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142536025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tao Wen , Yu-wang Chen , Tahir Abbas Syed , Darminder Ghataoura
{"title":"Examining communication network behaviors, structure and dynamics in an organizational hierarchy: A social network analysis approach","authors":"Tao Wen , Yu-wang Chen , Tahir Abbas Syed , Darminder Ghataoura","doi":"10.1016/j.ipm.2024.103927","DOIUrl":"10.1016/j.ipm.2024.103927","url":null,"abstract":"<div><div>Effectively understanding and enhancing communication flows among employees within an organizational hierarchy is crucial for optimizing operational and decision-making efficiency. To fill this significant gap in research, we propose a systematic and comprehensive social network analysis approach, coupled with a newly formulated communication vector and matrix, to examine communication behaviors and dynamics in an organizational hierarchy. We use the Enron email dataset, consisting of 619,499 emails, as an illustrative example to bridge the micro-macro divide of organizational communication research. A series of centrality measures are employed to evaluate the influential ability of individual employees, revealing descending influential ability and changing behaviors according to hierarchy. We also uncover that employees tend to communicate within the same functional teams through the identification of community structure and the proposed communication matrix. Furthermore, the emergent dynamics of organizational communication during a crisis are examined through a time-segmented dataset, showcasing the progressive absence of the legal team, the responsibility of top management, and the presence of hierarchy. By considering both individual and organizational perspectives, our work provides a systematic and data-driven approach to understanding how the organizational communication network emerges dynamically from individual communication behaviors within the hierarchy, which has the potential to enhance operational and decision-making efficiency within organizations.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103927"},"PeriodicalIF":7.4,"publicationDate":"2024-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142535922","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Unsupervised feature selection using sparse manifold learning: Auto-encoder approach","authors":"Amir Moslemi , Mina Jamshidi","doi":"10.1016/j.ipm.2024.103923","DOIUrl":"10.1016/j.ipm.2024.103923","url":null,"abstract":"<div><div>Feature selection techniques are widely being used as a preprocessing step to train machine learning algorithms to circumvent the curse of dimensionality, overfitting, and computation time challenges. Projection-based methods are frequently employed in feature selection, leveraging the extraction of linear relationships among features. The absence of nonlinear information extraction among features is notable in this context. While auto-encoder based techniques have recently gained traction for feature selection, their focus remains primarily on the encoding phase, as it is through this phase that the selected features are derived. The subtle point is that the performance of auto-encoder to obtain the most discriminative features is significantly affected by decoding phase. To address these challenges, in this paper, we proposed a novel feature selection based on auto-encoder to not only extracting nonlinear information among features but also decoding phase is regularized as well to enhance the performance of algorithm. In this study, we defined a new model of auto-encoder to preserve the topological information of reconstructed close to input data. To geometric structure of input data is preserved in projected space using Laplacian graph, and geometrical projected space is preserved in reconstructed space using a suitable term (abstract Laplacian graph of reconstructed data) in optimization problem. Preserving abstract Laplacian graph of reconstructed data close to Laplacian graph of input data affects the performance of feature selection and we experimentally showed this. Therefore, we show an effective approach to solve the objective of the corresponding problem. Since this approach can be mainly used for clustering aims, we conducted experiments on ten benchmark datasets and assessed our propped method based on clustering accuracy and normalized mutual information (NMI) metric. Our method obtained considerable superiority over recent state-of-the-art techniques in terms of NMI and accuracy.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 1","pages":"Article 103923"},"PeriodicalIF":7.4,"publicationDate":"2024-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142446437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}