Neural Networks最新文献

筛选
英文 中文
Deep clustering analysis via variational autoencoder with Gamma mixture latent embeddings. 基于混合隐嵌入的变分自编码器深度聚类分析。
IF 6 1区 计算机科学
Neural Networks Pub Date : 2025-03-01 Epub Date: 2024-12-04 DOI: 10.1016/j.neunet.2024.106979
Jiaxun Guo, Wentao Fan, Manar Amayri, Nizar Bouguila
{"title":"Deep clustering analysis via variational autoencoder with Gamma mixture latent embeddings.","authors":"Jiaxun Guo, Wentao Fan, Manar Amayri, Nizar Bouguila","doi":"10.1016/j.neunet.2024.106979","DOIUrl":"10.1016/j.neunet.2024.106979","url":null,"abstract":"<p><p>This article proposes a novel deep clustering model based on the variational autoencoder (VAE), named GamMM-VAE, which can learn latent representations of training data for clustering in an unsupervised manner. Most existing VAE-based deep clustering methods use the Gaussian mixture model (GMM) as a prior on the latent space. We employ a more flexible asymmetric Gamma mixture model to achieve higher quality embeddings of the data latent space. Second, since the Gamma is defined for strictly positive variables, in order to exploit the reparameterization trick of VAE, we propose a transformation method from Gaussian distribution to Gamma distribution. This method can also be considered a Gamma distribution reparameterization trick, allows gradients to be backpropagated through the sampling process in the VAE. Finally, we derive the evidence lower bound (ELBO) based on the Gamma mixture model in an effective way for the stochastic gradient variational Bayesian (SGVB) estimator to optimize the proposed model. ELBO, a variational inference objective, ensures the maximization of the approximation of the posterior distribution, while SGVB is a method used to perform efficient inference and learning in VAEs. We validate the effectiveness of our model through quantitative comparisons with other state-of-the-art deep clustering models on six benchmark datasets. Moreover, due to the generative nature of VAEs, the proposed model can generate highly realistic samples of specific classes without supervised information.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"183 ","pages":"106979"},"PeriodicalIF":6.0,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142814791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TENet: Targetness entanglement incorporating with multi-scale pooling and mutually-guided fusion for RGB-E object tracking. 宗旨:结合多尺度池化和相互引导融合的目标纠缠,用于RGB-E目标跟踪。
IF 6 1区 计算机科学
Neural Networks Pub Date : 2025-03-01 Epub Date: 2024-11-27 DOI: 10.1016/j.neunet.2024.106948
Pengcheng Shao, Tianyang Xu, Zhangyong Tang, Linze Li, Xiao-Jun Wu, Josef Kittler
{"title":"TENet: Targetness entanglement incorporating with multi-scale pooling and mutually-guided fusion for RGB-E object tracking.","authors":"Pengcheng Shao, Tianyang Xu, Zhangyong Tang, Linze Li, Xiao-Jun Wu, Josef Kittler","doi":"10.1016/j.neunet.2024.106948","DOIUrl":"10.1016/j.neunet.2024.106948","url":null,"abstract":"<p><p>There is currently strong interest in improving visual object tracking by augmenting the RGB modality with the output of a visual event camera that is particularly informative about the scene motion. However, existing approaches perform event feature extraction for RGB-E tracking using traditional appearance models, which have been optimised for RGB only tracking, without adapting it for the intrinsic characteristics of the event data. To address this problem, we propose an Event backbone (Pooler), designed to obtain a high-quality feature representation that is cognisant of the innate characteristics of the event data, namely its sparsity. In particular, Multi-Scale Pooling is introduced to capture all the motion feature trends within event data through the utilisation of diverse pooling kernel sizes. The association between the derived RGB and event representations is established by an innovative module performing adaptive Mutually Guided Fusion (MGF). Extensive experimental results show that our method significantly outperforms state-of-the-art trackers on two widely used RGB-E tracking datasets, including VisEvent and COESOT, where the precision and success rates on COESOT are improved by 4.9% and 5.2%, respectively. Our code will be available at https://github.com/SSSpc333/TENet.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"183 ","pages":"106948"},"PeriodicalIF":6.0,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142808460","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating global phase synchronization by quantifying multivariate mutual information and detecting network structure. 通过量化多元互信息和检测网络结构估计全局相位同步。
IF 6 1区 计算机科学
Neural Networks Pub Date : 2025-03-01 Epub Date: 2024-11-28 DOI: 10.1016/j.neunet.2024.106984
Zhaohui Li, Yanyu Xing, Xinyan Wang, Yunlu Cai, Xiaoxia Zhou, Xi Zhang
{"title":"Estimating global phase synchronization by quantifying multivariate mutual information and detecting network structure.","authors":"Zhaohui Li, Yanyu Xing, Xinyan Wang, Yunlu Cai, Xiaoxia Zhou, Xi Zhang","doi":"10.1016/j.neunet.2024.106984","DOIUrl":"10.1016/j.neunet.2024.106984","url":null,"abstract":"<p><p>In neuroscience, phase synchronization (PS) is a crucial mechanism that facilitates information processing and transmission between different brain regions. Specifically, global phase synchronization (GPS) characterizes the degree of PS among multivariate neural signals. In recent years, several GPS methods have been proposed. However, they primarily focus on the collective synchronization behavior of multivariate neural signals, while neglecting the structural difference between oscillator networks. Therefore, in this paper, we introduce a method named total correlation-based synchronization (TCS) to quantify GPS intensity by examining network organization. To evaluate the performance of TCS, we conducted simulations using the Rössler model and compared it to three existing methods: circular omega complexity, hyper-torus synchrony, and symbolic phase difference and permutation entropy. The results indicate that TCS outperforms the other methods at distinguishing the GPS intensity between networks with similar structures. And it offers insight into the separation and integration behavior of signals during synchronization. Furthermore, to validate this method with experimental data, TCS was applied to analyze the GPS variation of multichannel stereo-electroencephalography (SEEG) signals recorded from onset zones of patients with temporal lobe epilepsy. It was observed that the termination of seizures was associated with the increased GPS and the integration of brain regions. Taken together, TCS offers an alternative way to measure GPS of multivariate signals, which may shed new lights on the mechanism of brain functions and neurological disorders, such as learning, memory, epilepsy, and Alzheimer's disease.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"183 ","pages":"106984"},"PeriodicalIF":6.0,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142773941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CDCGAN: Class Distribution-aware Conditional GAN-based minority augmentation for imbalanced node classification. CDCGAN:基于类分布感知的条件gan的非平衡节点分类的少数增强。
IF 6 1区 计算机科学
Neural Networks Pub Date : 2025-03-01 Epub Date: 2024-11-29 DOI: 10.1016/j.neunet.2024.106933
Bojia Liu, Conghui Zheng, Fuhui Sun, Xiaoyan Wang, Li Pan
{"title":"CDCGAN: Class Distribution-aware Conditional GAN-based minority augmentation for imbalanced node classification.","authors":"Bojia Liu, Conghui Zheng, Fuhui Sun, Xiaoyan Wang, Li Pan","doi":"10.1016/j.neunet.2024.106933","DOIUrl":"10.1016/j.neunet.2024.106933","url":null,"abstract":"<p><p>Node classification is a fundamental task of Graph Neural Networks (GNNs). However, GNN models tend to suffer from the class imbalance problem which deteriorates the representation ability of minority classes, thus leading to unappealing classification performance. The most straightforward and effective solution is to augment the minority samples for balancing the representations of majority and minority classes. Previous methods leverage a limited number of labeled nodes to generate new samples, without considering the overall class characteristics and failing to reflect the underlying class distributions. Besides, they often yield less distinguishable nodes that cannot represent their original classes well, because they may incorporate useless information from other classes to form node representations. To address this issue, we propose a Class Distribution-aware Conditional Generative Adversarial Network (CDCGAN) to generate diverse and distinguishable minority nodes based on their class distribution characteristics. Specifically, we extract the node embeddings and class distributions while preserving the topology and attribute information, thus capturing the overall class characteristics. Then, the obtained class distributions are used to design a conditional generator, which incorporates nonlinear transformations to generate diverse minority nodes and leverages adversarial learning to maintain intrinsic class distribution characteristics. At last, to ensure the distinguishability of node representations, a unique discriminator is implemented to jointly discriminate and classify nodes of the augmented graph. Extensive experiments conducted on six datasets demonstrate that the proposed CDCGAN outperforms the state-of-the-art methods on widely used evaluation metrics. The source code is available at https://github.com/Crystal-LiuBojia/CDCGAN.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"183 ","pages":"106933"},"PeriodicalIF":6.0,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142787483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimization control for mean square synchronization of stochastic semi-Markov jump neural networks with non-fragile hidden information and actuator saturation. 具有非脆弱隐藏信息和致动器饱和的随机半马尔可夫跳变神经网络均方同步优化控制。
IF 6 1区 计算机科学
Neural Networks Pub Date : 2025-03-01 Epub Date: 2024-11-23 DOI: 10.1016/j.neunet.2024.106942
Zou Yang, Jun Wang, Kaibo Shi, Xiao Cai, Sheng Han
{"title":"Optimization control for mean square synchronization of stochastic semi-Markov jump neural networks with non-fragile hidden information and actuator saturation.","authors":"Zou Yang, Jun Wang, Kaibo Shi, Xiao Cai, Sheng Han","doi":"10.1016/j.neunet.2024.106942","DOIUrl":"10.1016/j.neunet.2024.106942","url":null,"abstract":"<p><p>This paper studies the asynchronous output feedback control and H<sub>∞</sub> synchronization problems for a class of continuous-time stochastic hidden semi-Markov jump neural networks (SMJNNs) affected by actuator saturation. Initially, a novel neural networks (NNs) model is constructed, incorporating semi-Markov process (SMP), hidden information, and Brownian motion to accurately simulate the complexity and uncertainty of real-world environments. Secondly, acknowledging system mode mismatches and the need for robust anti-interference capabilities, a non-fragile controller based on hidden information is proposed. The designed controller effectively mitigates the impact of uncertainties enhancing system reliability. Furthermore, sufficient conditions for stochastic mean square synchronization (MSS) within the domain of attraction are provided, and optimal control is achieved through the construction of a Lyapunov function based on SMP. Finally, the feasibility of the proposed method is verified through numerical examples.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"183 ","pages":"106942"},"PeriodicalIF":6.0,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142792280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OperaGAN: A simultaneous transfer network for opera makeup and complex headwear. OperaGAN:用于歌剧化妆和复杂头饰的同步传输网络。
IF 6 1区 计算机科学
Neural Networks Pub Date : 2025-03-01 Epub Date: 2024-12-09 DOI: 10.1016/j.neunet.2024.107015
Yue Ma, Chunjie Xu, Wei Song, Hanyu Liang
{"title":"OperaGAN: A simultaneous transfer network for opera makeup and complex headwear.","authors":"Yue Ma, Chunjie Xu, Wei Song, Hanyu Liang","doi":"10.1016/j.neunet.2024.107015","DOIUrl":"10.1016/j.neunet.2024.107015","url":null,"abstract":"<p><p>Standard makeup transfer techniques mainly focus on facial makeup. The texture details of headwear in style examples tend to be ignored. When dealing with complex portrait style transfer, simultaneous correct headwear and facial makeup transfer often cannot be guaranteed. In this paper, we construct the Peking Opera makeup dataset and propose a makeup transfer network for Opera faces called OperaGAN. This network consists of two key components: the Makeup and Headwear Style Encoder module (MHSEnc) and the Identity Coding and Makeup Fusion module (ICMF). MHSEnc is specifically designed to extract the style features from global and local perspectives. ICMF extracts the source image's facial features and combines them with the style features to generate the final transfer result. In addition, multiple overlapping local discriminators are utilized to transfer the high-frequency details in opera makeup. Experiments demonstrate that our method achieves state-of-the-art results in simultaneously transferring opera makeup and headwear. And the method can transfer headwear with missing content and controllable intensity makeup. The code and dataset will be available at https://github.com/Ivychun/OperaGAN.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"183 ","pages":"107015"},"PeriodicalIF":6.0,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142830605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-hop interpretable meta learning for few-shot temporal knowledge graph completion. 基于多跳可解释元学习的短时知识图补全。
IF 6 1区 计算机科学
Neural Networks Pub Date : 2025-03-01 Epub Date: 2024-11-28 DOI: 10.1016/j.neunet.2024.106981
Luyi Bai, Shuo Han, Lin Zhu
{"title":"Multi-hop interpretable meta learning for few-shot temporal knowledge graph completion.","authors":"Luyi Bai, Shuo Han, Lin Zhu","doi":"10.1016/j.neunet.2024.106981","DOIUrl":"10.1016/j.neunet.2024.106981","url":null,"abstract":"<p><p>Multi-hop path completion is a key part of temporal knowledge graph completion, which aims to infer complex relationships and obtain interpretable completion results. However, the traditional multi-hop path completion models mainly focus on the static knowledge graph with sufficient relationship instances, and does not consider the impact of timestamp information on the completion path, and is not suitable for few-shot relations. These limitations make the performance of these models not good when dealing with few-shot relationships in temporal knowledge graphs. In order to issue these challenges, we propose the Few-shot Temporal knowledge graph completion model based on the Multi-hop Interpretable meta-learning(FTMI). First, by aggregating the multi-hop neighbor information of the task relationship to generate a time-aware entity representation to enhance the task entity representation, the introduction of the timestamp information dimension enables the FTMI model to understand and deal with the impact of time changes on entities and relationships. In addition, time-aware entity pair representations are encoded using Transformer. At the same time, the specific representation of task relationship is generated by means of mean pooling layer aggregation. In addition, the model applies the reinforcement learning framework to the whole process of multi-hop path completion, constructs the strategy network, designs the new reward function to achieve the balance between path novelty and length, and helps Agent find the optimal path, thus realizing the completion of the temporal knowledge graph with few samples. In the training process, meta-learning is used to enable the model to quickly adapt to new tasks in the case of few samples. A huge number of experiments were carried out on two datasets to validate the model's validity.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"183 ","pages":"106981"},"PeriodicalIF":6.0,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142773971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large Language Model Enhanced Logic Tensor Network for Stance Detection. 面向姿态检测的大语言模型增强逻辑张量网络。
IF 6 1区 计算机科学
Neural Networks Pub Date : 2025-03-01 Epub Date: 2024-11-29 DOI: 10.1016/j.neunet.2024.106956
Genan Dai, Jiayu Liao, Sicheng Zhao, Xianghua Fu, Xiaojiang Peng, Hu Huang, Bowen Zhang
{"title":"Large Language Model Enhanced Logic Tensor Network for Stance Detection.","authors":"Genan Dai, Jiayu Liao, Sicheng Zhao, Xianghua Fu, Xiaojiang Peng, Hu Huang, Bowen Zhang","doi":"10.1016/j.neunet.2024.106956","DOIUrl":"10.1016/j.neunet.2024.106956","url":null,"abstract":"<p><p>Social media platforms, rich in user-generated content, offer a unique perspective on public opinion, making stance detection an essential task in opinion mining. However, traditional deep neural networks for stance detection often suffer from limitations, including the requirement for large amounts of labeled data, uninterpretability of prediction results, and difficulty in incorporating human intentions and domain knowledge. This paper introduces the First-Order Logic Aggregated Reasoning framework (FOLAR), an innovative approach that integrates first-order logic (FOL) with large language models (LLMs) to enhance the interpretability and efficacy of stance detection. FOLAR comprises three key components: a Knowledge Elicitation module that generates FOL rules using a chain-of-thought prompting method, a Logic Tensor Network (LTN) that encodes these rules for stance detection, and a Multi-Decision Fusion mechanism that aggregates LTNs' outputs to minimize biases and improve robustness. Our experiments on standard benchmarks demonstrate the effectiveness of FOLAR, showing it as a promising solution for explainable and accurate stance detection. The source code will be made publicly available to foster further research.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"183 ","pages":"106956"},"PeriodicalIF":6.0,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142787486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Plane coexistence behaviors for Hopfield neural network with two-memristor-interconnected neurons. 具有双神经元互联的 Hopfield 神经网络的平面共存行为
IF 6 1区 计算机科学
Neural Networks Pub Date : 2025-03-01 Epub Date: 2024-12-12 DOI: 10.1016/j.neunet.2024.107049
Fangyuan Li, Wangsheng Qin, Minqi Xi, Lianfa Bai, Bocheng Bao
{"title":"Plane coexistence behaviors for Hopfield neural network with two-memristor-interconnected neurons.","authors":"Fangyuan Li, Wangsheng Qin, Minqi Xi, Lianfa Bai, Bocheng Bao","doi":"10.1016/j.neunet.2024.107049","DOIUrl":"10.1016/j.neunet.2024.107049","url":null,"abstract":"<p><p>Memristors are commonly used as the connecting parts of neurons in brain-like neural networks. The memristors, unlike the existing literature, possess the capability to function as both self-connected synaptic weights and interconnected synaptic weights, thereby enabling the generation of intricate initials-regulated plane coexistence behaviors. To demonstrate this dynamical effect, a Hopfield neural network with two-memristor-interconnected neurons (TMIN-HNN) is proposed. On this basis, the stability distribution of the equilibrium points is analyzed, the related bifurcation behaviors are studied by utilizing some numerical simulation methods, and the plane coexistence behaviors are proved theoretically and revealed numerically. The results clarify that TMIN-HNN not only exhibits complex bifurcation behaviors, but also has initials-regulated plane coexistence behaviors. In particular, the coexistence attractors can be switched to different plane locations by the initial states of the two memristors. Finally, a digital experiment device is developed based on STM32 hardware board to verify the initials-regulated plane coexistence attractors.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"183 ","pages":"107049"},"PeriodicalIF":6.0,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142866113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved fractional-order gradient descent method based on multilayer perceptron. 基于多层感知器的改进分数阶梯度下降方法。
IF 6 1区 计算机科学
Neural Networks Pub Date : 2025-03-01 Epub Date: 2024-12-01 DOI: 10.1016/j.neunet.2024.106970
Xiaojun Zhou, Chunna Zhao, Yaqun Huang, Chengli Zhou, Junjie Ye
{"title":"Improved fractional-order gradient descent method based on multilayer perceptron.","authors":"Xiaojun Zhou, Chunna Zhao, Yaqun Huang, Chengli Zhou, Junjie Ye","doi":"10.1016/j.neunet.2024.106970","DOIUrl":"10.1016/j.neunet.2024.106970","url":null,"abstract":"<p><p>The fractional-order gradient descent (FOGD) method has been employed by numerous scholars in Artificial Neural Networks (ANN), with its superior performance validated both theoretically and experimentally. However, current FOGD methods only apply fractional-order differentiation to the loss function. The application of FOGD based on Autograd to hidden layers leverages the characteristics of fractional-order differentiation, significantly enhancing its flexibility. Moreover, the implementation of FOGD in the hidden layers serves as a necessary foundation for establishing a family of fractional-order deep learning optimizers, facilitating the widespread application of FOGD in deep learning. This paper proposes an improved fractional-order gradient descent (IFOGD) method based on Multilayer Perceptron (MLP). Firstly, a fractional matrix differentiation algorithm and its fractional matrix differentiation solver is proposed based on MLP, ensuring that IFOGD can be applied within the hidden layers. Subsequently, we overcome the issue of incorrect backpropagation direction caused by the absolute value symbol, ensuring that the IFOGD method does not cause divergence in the value of the loss function. Thirdly, fractional-order Autograd (FOAutograd) is proposed based on PyTorch by reconstructing Linear layer and Mean Squared Error Loss module. By combining FOAutograd with first-order adaptive deep learning optimizers, parameter matrices in each layer of ANN can be updated using fractional-order gradients. Finally, we compare and analyze the performance of IFOGD with other methods in simulation experiments and time series prediction tasks. The experimental results demonstrate that the IFOGD method exhibits performances.</p>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"183 ","pages":"106970"},"PeriodicalIF":6.0,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142792265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信