International Journal of Machine Learning and Cybernetics最新文献_第8页

Label distribution learning via second-order self-representation 通过二阶自表示进行标签分布学习

IF 5.6 3区计算机科学

International Journal of Machine Learning and Cybernetics Pub Date : 2024-08-11 DOI: 10.1007/s13042-024-02295-0

Peiqiu Yu, Lei Chen, Weiwei Li, Xiuyi Jia

{"title":"Label distribution learning via second-order self-representation","authors":"Peiqiu Yu, Lei Chen, Weiwei Li, Xiuyi Jia","doi":"10.1007/s13042-024-02295-0","DOIUrl":"https://doi.org/10.1007/s13042-024-02295-0","url":null,"abstract":"Label distribution learning is an effective learning approach for addressing label polysemy in the field of machine learning. In contrast to multi-label learning, label distribution learning can accurately represent the relative importance of labels and has richer semantic information about labels. Presently label distribution learning algorithms frequently integrate label correlation into their models to narrow down the assumption space of the model. However, existing label distribution learning works on label correlation use one-to-one or many-to-one correlation which has limitations in representing more complex correlation relationships. To address this issue, we attempt to extend the existing correlation relationships to many-to-many relationships. Specifically, we first construct a many-to-many correlation mining framework based on self-representation. Then by using the learned many-to-many correlation, a label distribution learning algorithm is designed. Our algorithm achieved the best performance in (78.21%) of cases across all datasets and all performance metrics with the algorithm having the best average ranking. It also demonstrated statistical superiority compared to the comparison algorithms in pairwise two-tailed t-tests. This paper introduces a novel approach to representing and applying label correlations in label distribution learning. The exploitation of this new many-to-many correlation can enhance the representational capabilities of label distribution learning models.","PeriodicalId":51327,"journal":{"name":"International Journal of Machine Learning and Cybernetics","volume":"311 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2024-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141938952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A generalized tri-factorization method for accurate matrix completion 用于精确矩阵补全的广义三因式分解法

IF 5.6 3区计算机科学

International Journal of Machine Learning and Cybernetics Pub Date : 2024-08-06 DOI: 10.1007/s13042-024-02289-y

Qing Liu, Hao Wu, Yu Zong, Zheng-Yu Liu

引用次数: 0

Multiscale-integrated deep learning approaches for short-term load forecasting 用于短期负荷预测的多矢量集成深度学习方法

IF 5.6 3区计算机科学

International Journal of Machine Learning and Cybernetics Pub Date : 2024-08-06 DOI: 10.1007/s13042-024-02302-4

Yang Yang, Yuchao Gao, Zijin Wang, Xi’an Li, Hu Zhou, Jinran Wu

引用次数: 0

Multi-graph aggregated graph neural network for heterogeneous graph representation learning 用于异构图表示学习的多图聚合图神经网络

IF 5.6 3区计算机科学

International Journal of Machine Learning and Cybernetics Pub Date : 2024-08-05 DOI: 10.1007/s13042-024-02294-1

Shuailei Zhu, Xiaofeng Wang, Shuaiming Lai, Yuntao Chen, Wenchao Zhai, Daying Quan, Yuanyuan Qi, Laishui Lv

{"title":"Multi-graph aggregated graph neural network for heterogeneous graph representation learning","authors":"Shuailei Zhu, Xiaofeng Wang, Shuaiming Lai, Yuntao Chen, Wenchao Zhai, Daying Quan, Yuanyuan Qi, Laishui Lv","doi":"10.1007/s13042-024-02294-1","DOIUrl":"https://doi.org/10.1007/s13042-024-02294-1","url":null,"abstract":"Heterogeneous graph neural networks have attracted considerable attention for their proficiency in handling intricate heterogeneous structures. However, most existing methods model semantic relationships in heterogeneous graphs by manually defining meta-paths, inadvertently overlooking the inherent incompleteness of such graphs. To address this issue, we propose a multi-graph aggregated graph neural network (MGAGNN) for heterogeneous graph representation learning, which simultaneously leverages attribute similarity and high-order semantic information between nodes. Specifically, MGAGNN first employs the feature graph generator to generate a feature graph for completing the original graph structure. A semantic graph is then generated using a semantic graph generator, capturing higher-order semantic information through automatic meta-path learning. Finally, we aggregate the two candidate graphs to reconstruct a new heterogeneous graph and learn node embedding by graph convolutional networks. Extensive experiments on real-world datasets demonstrate the superior performance of the proposed method over state-of-the-art approaches.","PeriodicalId":51327,"journal":{"name":"International Journal of Machine Learning and Cybernetics","volume":"11 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141938953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Mobile robot path planning based on multi-experience pool deep deterministic policy gradient in unknown environment 未知环境下基于多经验池深度确定性策略梯度的移动机器人路径规划

IF 5.6 3区计算机科学

International Journal of Machine Learning and Cybernetics Pub Date : 2024-08-04 DOI: 10.1007/s13042-024-02281-6

Linxin Wei, Quanxing Xu, Ziyu Hu

{"title":"Mobile robot path planning based on multi-experience pool deep deterministic policy gradient in unknown environment","authors":"Linxin Wei, Quanxing Xu, Ziyu Hu","doi":"10.1007/s13042-024-02281-6","DOIUrl":"https://doi.org/10.1007/s13042-024-02281-6","url":null,"abstract":"The path planning for unmanned mobile robots has always been a crucial issue, especially in unknown environments. Reinforcement learning widely used in path planning due to its ability to learn from unknown environments. But, in unknown environments, deep reinforcement learning algorithms have problems such as long training time and instability. In this article, improvements have been made to the deep deterministic policy gradient algorithm (DDPG) to address the aforementioned issues. Firstly, the experience pool is divided into different experience pools based on the difference between adjacent states; Secondly, experience is collected from various experience pools in different proportions for training, enabling the robot to achieve good obstacle avoidance ability; Finally, by designing a guided reward function, the convergence speed of the algorithm has been improved, and the robot can find the target point faster. The algorithm has been tested in practice and simulation, and the results show that it can enable robots to complete path planning tasks in complex unknown environments.","PeriodicalId":51327,"journal":{"name":"International Journal of Machine Learning and Cybernetics","volume":"7 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2024-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141939012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Doublem-net: multi-scale spatial pyramid pooling-fast and multi-path adaptive feature pyramid network for UAV detection Doublem-net：用于无人机探测的多尺度空间金字塔集合--快速和多路径自适应特征金字塔网络

IF 5.6 3区计算机科学

International Journal of Machine Learning and Cybernetics Pub Date : 2024-07-26 DOI: 10.1007/s13042-024-02278-1

Zhongxu Li, Qihan He, Hong Zhao, Wenyuan Yang

{"title":"Doublem-net: multi-scale spatial pyramid pooling-fast and multi-path adaptive feature pyramid network for UAV detection","authors":"Zhongxu Li, Qihan He, Hong Zhao, Wenyuan Yang","doi":"10.1007/s13042-024-02278-1","DOIUrl":"https://doi.org/10.1007/s13042-024-02278-1","url":null,"abstract":"Unmanned aerial vehicles (UAVs) are extensively applied in military, rescue operations, and traffic detection fields, resulting from their flexibility, low cost, and autonomous flight capabilities. However, due to the drone’s flight height and shooting angle, the objects in aerial images are smaller, denser, and more complex than those in general images, triggering an unsatisfactory target detection effect. In this paper, we propose a model for UAV detection called DoubleM-Net, which contains multi-scale spatial pyramid pooling-fast (MS-SPPF) and Multi-Path Adaptive Feature Pyramid Network (MPA-FPN). DoubleM-Net utilizes the MS-SPPF module to extract feature maps of multiple receptive field sizes. Then, the MPA-FPN module first fuses features from every two adjacent scales, followed by a level-by-level interactive fusion of features. First, using the backbone network as the feature extractor, multiple feature maps of different scale ranges are extracted from the input image. Second, the MS-SPPF uses different pooled kernels to repeat multiple pooled operations at various scales to achieve rich multi-perceptive field features. Finally, the MPA-FPN module first incorporates semantic information between each adjacent two-scale layer. The top-level features are then passed back to the bottom level-by-level, and the underlying features are enhanced, enabling interaction and integration of features at different scales. The experimental results show that the mAP50-95 ratio of DoubleM-Net on the VisDrone dataset is 27.5%, and that of Doublem-Net on the DroneVehicle dataset in RGB and Infrared mode is 55.0% and 60.4%, respectively. Our model demonstrates excellent performance in air-to-ground image detection tasks, with exceptional results in detecting small objects.","PeriodicalId":51327,"journal":{"name":"International Journal of Machine Learning and Cybernetics","volume":"16 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141770209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Bgman: Boundary-Prior-Guided Multi-scale Aggregation Network for skin lesion segmentation Bgman：用于皮损分割的边界先导多尺度聚合网络

IF 5.6 3区计算机科学

International Journal of Machine Learning and Cybernetics Pub Date : 2024-07-26 DOI: 10.1007/s13042-024-02284-3

Zhenyang Huang, Yixing Zhao, Jinjiang Li, Yepeng Liu

{"title":"Bgman: Boundary-Prior-Guided Multi-scale Aggregation Network for skin lesion segmentation","authors":"Zhenyang Huang, Yixing Zhao, Jinjiang Li, Yepeng Liu","doi":"10.1007/s13042-024-02284-3","DOIUrl":"https://doi.org/10.1007/s13042-024-02284-3","url":null,"abstract":"Skin lesion segmentation is a fundamental task in the field of medical image analysis. Deep learning approaches have become essential tools for segmenting medical images, as their accuracy in effectively analyzing abnormalities plays a critical role in determining the ultimate diagnostic results. Because of the inherent difficulties presented by medical images, including variations in shapes and sizes, along with the indistinct boundaries between lesions and the surrounding backgrounds, certain conventional algorithms face difficulties in fulfilling the growing requirements for elevated accuracy in processing medical images. To enhance the performance in capturing edge features and fine details of lesion processing, this paper presents the Boundary-Prior-Guided Multi-Scale Aggregation Network for skin lesion segmentation (BGMAN). The proposed BGMAN follows a basic Encoder–Decoder structure, wherein the encoder network employs prevalent CNN-based architectures to capture semantic information. We propose the Transformer Bridge Block (TBB) and employ it to enhance multi-scale features captured by the encoder. The TBB strengthens the intensity of weak feature information, establishing long-distance relationships between feature information. In order to augment BGMAN’s capability to identify boundaries, a boundary-guided decoder is designed, utilizing the Boundary Aware Block (BAB) and Cross Scale Fusion Block (CSFB) to guide the decoding learning process. BAB can acquire features embedded with explicit boundary information under the supervision of a boundary mask, while CSFB aggregates boundary features from different scales using learnable embeddings. The proposed method has been validated on the ISIC2016, ISIC2017, and (PH^2) datasets. It outperforms current mainstream networks with the following results: F1 92.99 and IoU 87.71 on ISIC2016, F1 86.42 and IoU 78.34 on ISIC2017, and F1 94.83 and IoU 90.26 on (PH^2).","PeriodicalId":51327,"journal":{"name":"International Journal of Machine Learning and Cybernetics","volume":"43 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141784936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Quasi-framelets: robust graph neural networks via adaptive framelet convolution 准小帧：通过自适应小帧卷积实现鲁棒图神经网络

IF 5.6 3区计算机科学

International Journal of Machine Learning and Cybernetics Pub Date : 2024-07-26 DOI: 10.1007/s13042-024-02286-1

Mengxi Yang, Dai Shi, Xuebin Zheng, Jie Yin, Junbin Gao

{"title":"Quasi-framelets: robust graph neural networks via adaptive framelet convolution","authors":"Mengxi Yang, Dai Shi, Xuebin Zheng, Jie Yin, Junbin Gao","doi":"10.1007/s13042-024-02286-1","DOIUrl":"https://doi.org/10.1007/s13042-024-02286-1","url":null,"abstract":"This paper aims to provide a novel design of a multiscale framelet convolution for spectral graph neural networks (GNNs). While current spectral methods excel in various graph learning tasks, they often lack the flexibility to adapt to noisy, incomplete, or perturbed graph signals, making them fragile in such conditions. Our newly proposed framelet convolution addresses these limitations by decomposing graph data into low-pass and high-pass spectra through a finely-tuned multiscale approach. Our approach directly designs filtering functions within the spectral domain, allowing for precise control over the spectral components. The proposed design excels in filtering out unwanted spectral information and significantly reduces the adverse effects of noisy graph signals. Our approach not only enhances the robustness of GNNs but also preserves crucial graph features and structures. Through extensive experiments on diverse, real-world graph datasets, we demonstrate that our framelet convolution achieves superior performance in node classification tasks. It exhibits remarkable resilience to noisy data and adversarial attacks, highlighting its potential as a robust solution for real-world graph applications. This advancement opens new avenues for more adaptive and reliable spectral GNN architectures.","PeriodicalId":51327,"journal":{"name":"International Journal of Machine Learning and Cybernetics","volume":"13 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141770180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Visible-infrared person re-identification with complementary feature fusion and identity consistency learning 利用互补特征融合和身份一致性学习进行可见红外人员再识别

IF 5.6 3区计算机科学

International Journal of Machine Learning and Cybernetics Pub Date : 2024-07-24 DOI: 10.1007/s13042-024-02282-5

Yiming Wang, Xiaolong Chen, Yi Chai, Kaixiong Xu, Yutao Jiang, Bowen Liu

{"title":"Visible-infrared person re-identification with complementary feature fusion and identity consistency learning","authors":"Yiming Wang, Xiaolong Chen, Yi Chai, Kaixiong Xu, Yutao Jiang, Bowen Liu","doi":"10.1007/s13042-024-02282-5","DOIUrl":"https://doi.org/10.1007/s13042-024-02282-5","url":null,"abstract":"The dual-mode 24/7 monitoring systems continuously obtain visible and infrared images in a real scene. However, differences such as color and texture between these cross-modality images pose challenges for visible-infrared person re-identification (ReID). Currently, the general method is modality-shared feature learning or modal-specific information compensation based on style transfer, but the modality differences often result in the inevitable loss of valuable feature information in the training process. To address this issue, A complementary feature fusion and identity consistency learning (CFF-ICL) method is proposed. On the one hand, the multiple feature fusion mechanism based on cross attention is used to promote the features extracted by the two groups of networks in the same modality image to show a more obvious complementary relationship to improve the comprehensiveness of feature information. On the other hand, the designed collaborative adversarial mechanism between dual discriminators and feature extraction network is designed to remove the modality differences, and then construct the identity consistency between visible and infrared images. Experimental results by testing on SYSU-MM01 and RegDB datasets verify the method’s effectiveness and superiority.","PeriodicalId":51327,"journal":{"name":"International Journal of Machine Learning and Cybernetics","volume":"25 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141770210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Text semantic matching algorithm based on the introduction of external knowledge under contrastive learning 对比学习下基于外部知识引入的文本语义匹配算法

IF 5.6 3区计算机科学

International Journal of Machine Learning and Cybernetics Pub Date : 2024-07-24 DOI: 10.1007/s13042-024-02285-2

Jie Hu, Yinglian Zhu, Lishan Wu, Qilei Luo, Fei Teng, Tianrui Li

{"title":"Text semantic matching algorithm based on the introduction of external knowledge under contrastive learning","authors":"Jie Hu, Yinglian Zhu, Lishan Wu, Qilei Luo, Fei Teng, Tianrui Li","doi":"10.1007/s13042-024-02285-2","DOIUrl":"https://doi.org/10.1007/s13042-024-02285-2","url":null,"abstract":"Measuring the semantic similarity between two texts is a fundamental aspect of text semantic matching. Each word in the texts holds a weighted meaning, and it is essential for the model to effectively capture the most crucial knowledge. However, current text matching methods based on BERT have limitations in acquiring professional domain knowledge. BERT requires extensive domain-specific training data to perform well in specialized fields such as medicine, where obtaining labeled data is challenging. In addition, current text matching models that inject domain knowledge often rely on creating new training tasks to fine-tune the model, which is time-consuming. Although existing works have directly injected domain knowledge into BERT through similarity matrices, they struggle to handle the challenge of small sample sizes in professional fields. Contrastive learning trains a representation learning model by generating instances that exhibit either similarity or dissimilarity, so that a more general representation can be learned with a small number of samples. In this paper, we propose to directly integrate the word similarity matrix into BERT’s multi-head attention mechanism under a contrastive learning framework to align similar words during training. Furthermore, in the context of Chinese medical applications, we propose an entity MASK approach to enhance the understanding of medical terms by pre-trained models. The proposed method helps BERT acquire domain knowledge to better learn text representations in professional fields. Extensive experimental results have shown that the algorithm significantly improves the performance of the text matching model, especially when training data is limited.","PeriodicalId":51327,"journal":{"name":"International Journal of Machine Learning and Cybernetics","volume":"13 1","pages":""},"PeriodicalIF":5.6,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141770181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0