Pattern Analysis and Applications最新文献

TabMixer: advancing tabular data analysis with an enhanced MLP-mixer approach. TabMixer：通过增强的MLP-mixer方法推进表格数据分析。

IF 3.7 4区计算机科学

Pattern Analysis and Applications Pub Date : 2025-06-01 Epub Date: 2025-02-21 DOI: 10.1007/s10044-025-01423-y

Ali Eslamian, Qiang Cheng

{"title":"TabMixer: advancing tabular data analysis with an enhanced MLP-mixer approach.","authors":"Ali Eslamian, Qiang Cheng","doi":"10.1007/s10044-025-01423-y","DOIUrl":"10.1007/s10044-025-01423-y","url":null,"abstract":"Tabular data, prevalent in relational databases and spreadsheets, is fundamental across fields like healthcare, engineering, and finance. Despite significant advances in tabular data learning, critical challenges remain: handling missing values, addressing class imbalance, enabling transfer learning, and facilitating feature incremental learning beyond traditional supervised paradigms. We introduce TabMixer, an innovative model that enhances the multilayer perceptron (MLP) mixer architecture to address these challenges. TabMixer incorporates a self-attention mechanism, making it versatile across various learning scenarios including supervised learning, transfer learning, and feature incremental learning. Extensive experiments on eight public datasets demonstrate TabMixer's superior performance over existing state-of-the-art methods. Notably, TabMixer achieved substantial improvements in ANOVA AUC across all scenarios: a 4% increase in supervised learning (0.840 to 0.881), 8% in transfer learning (0.803 to 0.872), and 4% in feature incremental learning (0.806 to 0.843). TabMixer demonstrates high computational efficiency and scalability through reduced floating-point operations and learnable parameters. Moreover, it exhibits strong resilience to missing values and class imbalances through both its architectural design and optional preprocessing enhancements. These results establish TabMixer as a promising model for tabular data analysis and a valuable tool for diverse applications.","PeriodicalId":54639,"journal":{"name":"Pattern Analysis and Applications","volume":"28 2","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12053537/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144060749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Animal re-identification in video through track clustering. 基于轨迹聚类的视频动物再识别。

IF 3.7 4区计算机科学

Pattern Analysis and Applications Pub Date : 2025-01-01 Epub Date: 2025-06-19 DOI: 10.1007/s10044-025-01497-8

Francis J Williams, Samuel L Hennessey, Ludmila I Kuncheva

引用次数: 0

K-BEST subspace clustering: kernel-friendly block-diagonal embedded and similarity-preserving transformed subspace clustering K-BEST 子空间聚类：内核友好的块对角嵌入式和保全相似性的变换子空间聚类

IF 3.9 4区计算机科学

Pattern Analysis and Applications Pub Date : 2024-09-19 DOI: 10.1007/s10044-024-01336-2

Jyoti Maggu, Anurag Goel

{"title":"K-BEST subspace clustering: kernel-friendly block-diagonal embedded and similarity-preserving transformed subspace clustering","authors":"Jyoti Maggu, Anurag Goel","doi":"10.1007/s10044-024-01336-2","DOIUrl":"https://doi.org/10.1007/s10044-024-01336-2","url":null,"abstract":"Subspace clustering methods, employing sparse and low-rank models, have demonstrated efficacy in clustering high-dimensional data. These approaches typically assume the separability of input data into distinct subspaces, a premise that does not hold true in general. Furthermore, prevalent low-rank and sparse methods relying on self-expression exhibit effectiveness primarily with linear structure data, facing limitations in processing datasets with intricate nonlinear structures. While kernel subspace clustering methods excel in handling nonlinear structures, they may compromise similarity information during the reconstruction of original data in kernel space. Additionally, these methods may fall short of attaining an affinity matrix with an optimal block-diagonal property. In response to these challenges, this paper introduces a novel subspace clustering approach named Similarity Preserving Kernel Block Diagonal Representation based Transformed Subspace Clustering (KBD-TSC). KBD-TSC contributes in three key aspects: (1) integration of a kernelized version of transform learning within a subspace clustering framework, introducing a block diagonal representation term to generate an affinity matrix with a block-diagonal structure. (2) Construction and integration of a similarity preserving regularizer into the model by minimizing the discrepancy between inner products of the original data and those of the reconstructed data in kernel space. This facilitates enhanced preservation of similarity information between the original data points. (3) Proposal of KBD-TSC by integrating the block diagonal representation term and similarity preserving regularizer into a kernel self-expressing model. The optimization of the proposed model is efficiently addressed through the alternating direction method of multipliers. This study validates the effectiveness of the proposed KBD-TSC method through experimental results obtained from nine datasets, showcasing its potential in addressing the limitations of existing subspace clustering techniques.","PeriodicalId":54639,"journal":{"name":"Pattern Analysis and Applications","volume":"14 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142247451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Hidden Markov models with multivariate bounded asymmetric student’s t-mixture model emissions 隐马尔可夫模型与多变量有界非对称学生 t 混合模型排放

IF 3.9 4区计算机科学

Pattern Analysis and Applications Pub Date : 2024-09-18 DOI: 10.1007/s10044-024-01341-5

Ons Bouarada, Muhammad Azam, Manar Amayri, Nizar Bouguila

{"title":"Hidden Markov models with multivariate bounded asymmetric student’s t-mixture model emissions","authors":"Ons Bouarada, Muhammad Azam, Manar Amayri, Nizar Bouguila","doi":"10.1007/s10044-024-01341-5","DOIUrl":"https://doi.org/10.1007/s10044-024-01341-5","url":null,"abstract":"Hidden Markov models (HMMs) are popular methods for continuous sequential data modeling and classification tasks. In such applications, the observation emission densities of the HMM hidden states are generally continuous, can vary from one model to the other, and are typically modeled by elliptically contoured distributions, namely Gaussians or Student’s t-distributions. In this context, this paper proposes a novel HMM with Bounded Asymmetric Student’s t-Mixture Model (BASMM) emissions. Our new BASMMHMM is introduced in the light of the added robustness guaranteed by the BASMM in comparison to other popular emission distributions such as the Gaussian Mixture Model (GMM). In fact, GMMs generally have a limited performance with outliers in the data sets (observations) that the HMM is fitted to. Also, GMMs cannot sufficiently model skewed populations, which are typical in many fields, such as financial or signal processing-related data sets. An excellent alternative to solve this problem is found in Student’s t-mixture models. They have similar behaviour and shape to GMMs, but with heavier tails. This allows to have more tolerance towards data sets that span extensive ranges and include outliers. Asymmetry and bounded support are also important features that can further extend the model’s flexibility and fit the imperfections of real-world data. This leads us to explore the effectiveness of the BASMM as an observation emission distribution in HMMs, hence the proposed BASMMHMM. We will also demonstrate the improved robustness of our model by presenting the results of three different experiments: occupancy estimation, stock price prediction, and human activity recognition.","PeriodicalId":54639,"journal":{"name":"Pattern Analysis and Applications","volume":"14 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142247454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Research on decoupled adaptive graph convolution networks based on skeleton data for action recognition 基于骨架数据的去耦合自适应图卷积网络在动作识别中的应用研究

IF 3.9 4区计算机科学

Pattern Analysis and Applications Pub Date : 2024-09-18 DOI: 10.1007/s10044-024-01319-3

Haigang Deng, Guocheng Lin, Chengwei Li, Chuanxu Wang

{"title":"Research on decoupled adaptive graph convolution networks based on skeleton data for action recognition","authors":"Haigang Deng, Guocheng Lin, Chengwei Li, Chuanxu Wang","doi":"10.1007/s10044-024-01319-3","DOIUrl":"https://doi.org/10.1007/s10044-024-01319-3","url":null,"abstract":"Graph convolutional network is apt for feature extraction in terms of non-Euclidian human skeleton data, but its adjacency matrix is fixed and the receptive field is small, which results in bias representation for skeleton intrinsic information. In addition, the operation of mean pooling on spatio-temporal features in classification layer will result in losing information and degrade recognition accuracy. To this end, the Decoupled Adaptive Graph Convolutional Network (DAGCN) is proposed. Specifically, a multi-level adaptive adjacency matrix is designed, which can dynamically obtain the rich correlation information among the skeleton nodes by a non-local adaptive algorithm. Whereafter, a new Residual Multi-scale Temporal Convolution Network (RMTCN) is proposed to fully extract temporal feature of the above decoupled skeleton dada. For the second problem in classification, we decompose the spatio-temporal features into three parts as spatial, temporal, spatio-temporal information, they are averagely pooled respectively, and added together for classification, denoted as STMP (spatio-temporal mean pooling) module. Experimental results show that our algorithm achieves accuracy of 96.5%, 90.6%, 96.4% on NTU-RGB+D60, NTU-RGB+D120 and NW-UCLA data sets respectively.","PeriodicalId":54639,"journal":{"name":"Pattern Analysis and Applications","volume":"2 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142247453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

YOLOv7-GCM: a detection algorithm for creek waste based on improved YOLOv7 model YOLOv7-GCM：基于改进的 YOLOv7 模型的溪流废物检测算法

IF 3.9 4区计算机科学

Pattern Analysis and Applications Pub Date : 2024-09-17 DOI: 10.1007/s10044-024-01338-0

Jianhua Qin, Honglan Zhou, Huaian Yi, Luyao Ma, Jianhan Nie, Tingting Huang

{"title":"YOLOv7-GCM: a detection algorithm for creek waste based on improved YOLOv7 model","authors":"Jianhua Qin, Honglan Zhou, Huaian Yi, Luyao Ma, Jianhan Nie, Tingting Huang","doi":"10.1007/s10044-024-01338-0","DOIUrl":"https://doi.org/10.1007/s10044-024-01338-0","url":null,"abstract":"To enhance the cleanliness of creek environments, quadruped robots can be utilized to detect for creek waste. The continuous changes in the water environment significantly reduce the accuracy of image detection when using quadruped robots for image acquisition. In order to improve the accuracy of quadruped robots in waste detection, this article proposed a detection model called YOLOv7-GCM model for creek waste. The model integrated a global attention mechanism (GAM) into the YOLOv7 model, which achieved accurate waste detection in ever-changing backgrounds and underwater conditions. A content-aware reassembly of features (CARAFE) replaced a up-sampling of the YOLOv7 model to achieve more accurate and efficient feature reconstruction. A minimum point distance intersection over union (MPDIOU) loss function replaced the CIOU loss function of the YOLOv7 model to more accurately measure the similarity between target boxes and predictive boxes. After the aforementioned improvements, the YOLOv7-GCM model was obtained. A quadruped robot to patrol the creek and collect images of creek waste. Finally, the YOLOv7-GCM model was trained on the creek waste dataset. The outcomes of the experiment show that the precision rate of the YOLOv7-GCM model has increased by 4.2% and the mean average precision (mAP@0.5) has accumulated by 2.1%. The YOLOv7-GCM model provides a new method for identifying creek waste, which may help promote efficient waste management.","PeriodicalId":54639,"journal":{"name":"Pattern Analysis and Applications","volume":"36 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142247452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Unveiling the unseen: novel strategies for object detection beyond known distributions 揭开看不见的面纱：已知分布之外的物体检测新策略

IF 3.9 4区计算机科学

Pattern Analysis and Applications Pub Date : 2024-09-13 DOI: 10.1007/s10044-024-01334-4

S. Devi, R. Dayana, P. Malarvezhi

{"title":"Unveiling the unseen: novel strategies for object detection beyond known distributions","authors":"S. Devi, R. Dayana, P. Malarvezhi","doi":"10.1007/s10044-024-01334-4","DOIUrl":"https://doi.org/10.1007/s10044-024-01334-4","url":null,"abstract":"In contemporary machine learning, models often struggle with data distribution variations, severely impacting their out-of-distribution (OOD) generalization and detection capabilities. Current object detection methods, relying on virtual outlier synthesis and class-conditional density estimation, struggle to effectively distinguish OOD samples. They often depend on accurate density estimation and may produce virtual outliers that lack realism, particularly in complex or dynamic environments. Furthermore, previous research has typically addressed covariate and semantic shifts independently, resulting in fragmented solutions that fail to comprehensively tackle OOD generalization. This study introduces a unified approach to enhance OOD generalization in object recognition models, addressing these critical gaps. The strategy involves employing adversarial perturbations on the ID (In-Distribution) dataset to enhance the model’s resilience to distribution shifts, thereby simulating potential real-world scenarios characterized by imperceptible variations. Additionally, the integration of Maximum Mean Discrepancy (MMD) at the object level effectively discriminates between ID and OOD samples by quantifying distributional differences. For precise OOD detection, a K-nearest neighbors (KNN) algorithm is used during inference to measure similarity between samples and their closest neighbors in the training data. Evaluations on benchmark datasets, including PASCAL VOC and BDD100K as ID, with COCO and Open Images subsets as OOD, demonstrate significant improvements in OOD generalization compared to existing methods. These discoveries underscore the framework’s potential to elevate the dependability and flexibility of object recognition systems in practical scenarios, particularly in autonomous vehicles where accurate object detection under diverse conditions is critical for safety. This research contributes to advancing OOD generalization techniques and lays the groundwork for future refinement to address evolving challenges in machine learning applications. The code can be accessed from https://github.com/DeviSPhd/(OODG_OD)","PeriodicalId":54639,"journal":{"name":"Pattern Analysis and Applications","volume":"94 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142247456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

LDC-PP-YOLOE: a lightweight model for detecting and counting citrus fruit LDC-PP-YOLOE：检测和计数柑橘类水果的轻量级模型

IF 3.9 4区计算机科学

Pattern Analysis and Applications Pub Date : 2024-09-13 DOI: 10.1007/s10044-024-01329-1

Yibo Lv, Shenglian Lu, Xiaoyu Liu, Jiangchuan Bao, Binghao Liu, Ming Chen, Guo Li

{"title":"LDC-PP-YOLOE: a lightweight model for detecting and counting citrus fruit","authors":"Yibo Lv, Shenglian Lu, Xiaoyu Liu, Jiangchuan Bao, Binghao Liu, Ming Chen, Guo Li","doi":"10.1007/s10044-024-01329-1","DOIUrl":"https://doi.org/10.1007/s10044-024-01329-1","url":null,"abstract":"In the citrus orchard environment, accurate counting of the fruit, and the use of lightweight detection methods are the key presteps to automate citrus picking and yield estimations. Most high-precision fruit detection models based on deep learning use complex models with devices that require high quantities of computational resources and memory. Devices with limited resources cannot meet the requirements of these models. Thus, to overcome this problem, we focus on creating a lightweight model with a convolutional neural network. In this research, we propose a lightweight citrus detection model based on the mobile device LDC-PP-YOLOE. LDC-PP-YOLOE is improved based on PP-YOLOE by using localized knowledge distillation and CBAM, with a mAP@0.5 of 88(%), mAP@0.95 of 51.3(%), params of 8 M and speed of 0.34 s, respectively. The performance of LDC-PP-YOLOE was compared against commonly used detectors and LDC-PP-YOLOE’s mAP@0.5 was 2.5, 6.9 and 16.3(%), and was 4.3(%) greater than Faster R-CNN, YOLOX-s and PicoDet-L, respectively. LDC-PP-YOLOE achieved an RMSE of 8.63 and an MSE of 5.27 compared to the ground truth on citrus applications. In addition, we used apple and passion fruit datasets to verify the generalization of the model; the mAP@0.5 is improved by 1 and 0.7(%). LDC-PP-YOLOE can be used as a lightweight model to help growers track citrus populations and optimize citrus yields in complex citrus orchard environments with resource-limited equipment. It also provides a solution for lightweight models.","PeriodicalId":54639,"journal":{"name":"Pattern Analysis and Applications","volume":"190 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142247455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Methods for calculating gliding-box lacunarity efficiently on large datasets 在大型数据集上高效计算滑动盒缺陷的方法

IF 3.9 4区计算机科学

Pattern Analysis and Applications Pub Date : 2024-09-13 DOI: 10.1007/s10044-024-01332-6

Bálint Barna H. Kovács, Miklós Erdélyi

引用次数: 0

DN3MF: deep neural network for non-negative matrix factorization towards low rank approximation DN3MF：面向低等级逼近的非负矩阵因式分解深度神经网络

IF 3.9 4区计算机科学

Pattern Analysis and Applications Pub Date : 2024-09-11 DOI: 10.1007/s10044-024-01335-3

Prasun Dutta, Rajat K. De

{"title":"DN3MF: deep neural network for non-negative matrix factorization towards low rank approximation","authors":"Prasun Dutta, Rajat K. De","doi":"10.1007/s10044-024-01335-3","DOIUrl":"https://doi.org/10.1007/s10044-024-01335-3","url":null,"abstract":"Dimension reduction is one of the most sought-after methodologies to deal with high-dimensional ever-expanding complex datasets. Non-negative matrix factorization (NMF) is one such technique for dimension reduction. Here, a multiple deconstruction multiple reconstruction deep learning model (DN3MF) for NMF targeted towards low rank approximation, has been developed. Non-negative input data has been processed using hierarchical learning to generate part-based sparse and meaningful representation. The novel design of DN3MF ensures the non-negativity requirement of the model. The use of Xavier initialization technique solves the exploding or vanishing gradient problem. The objective function of the model has been designed employing regularization, ensuring the best possible approximation of the input matrix. A novel adaptive learning mechanism has been developed to accomplish the objective of the model. The superior performance of the proposed model has been established by comparing the results obtained by the model with that of six other well-established dimension reduction algorithms on three well-known datasets in terms of preservation of the local structure of data in low rank embedding, and in the context of downstream analyses using classification and clustering. The statistical significance of the results has also been established. The outcome clearly demonstrates DN3MF’s superiority over compared dimension reduction approaches in terms of both statistical and intrinsic property preservation standards. The comparative analysis of all seven dimensionality reduction algorithms including DN3MF with respect to the computational complexity and a pictorial depiction of the convergence analysis for both stages of DN3MF have also been presented.","PeriodicalId":54639,"journal":{"name":"Pattern Analysis and Applications","volume":"6 1","pages":""},"PeriodicalIF":3.9,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142179467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0