Neural Computing and Applications最新文献_第8页

Computational and artificial neural network study on ternary nanofluid flow with heat and mass transfer with magnetohydrodynamics and mass transpiration 三元纳米流体流动的计算与人工神经网络研究--磁流体力学与质量蒸腾的传热传质关系

Neural Computing and Applications Pub Date : 2024-08-19 DOI: 10.1007/s00521-024-10325-9

U. S. Mahabaleshwar, K. M. Nihaal, Dia Zeidan, T. Dbouk, D. Laroze

{"title":"Computational and artificial neural network study on ternary nanofluid flow with heat and mass transfer with magnetohydrodynamics and mass transpiration","authors":"U. S. Mahabaleshwar, K. M. Nihaal, Dia Zeidan, T. Dbouk, D. Laroze","doi":"10.1007/s00521-024-10325-9","DOIUrl":"https://doi.org/10.1007/s00521-024-10325-9","url":null,"abstract":"Ternary nanofluids have been an interesting field for academics and researchers in the modern technological era because of their advanced thermophysical properties and the desire to increase heat transfer rates. Furthermore, the innovative, sophisticated artificial neural network strategy with the Levenberg–Marquardt backpropagation technique (LMBPT) is proposed for research on heat and mass transport over non-Newtonian ternary Casson fluid on a radially extending surface with magnetic field and convective boundary conditions. The main objective of the current research is to conduct a comparative study of numerical solutions of the ternary nanofluid model of heat/mass transport utilizing the artificial neural network (ANN) together with the (LMBPT). To accurately represent complex patterns, neural networks modify their parameters flexibly, resulting in more accurate predictions and greater generalization with numerical outcomes. The model equations were reduced from partial to ODEs through applying appropriate similarity variables. The shooting technique and the byp-4c algorithm were then used to analyze the numerical data. The current study reveals that a rise in the Casson parameter diminishes the fluid velocity but an opposite nature is seen in thermal distribution for rising behavior of heat source/sink and Biot number, and the concentration profile tends to deteriorate when the mass transfer is elevated. Furthermore, the resulting values of the significant engineering coefficients are numerically analyzed and tabulated.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"45 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A reduced-form multigrid approach for ANN equivalent to classic multigrid expansion 等同于经典多网格扩展的简化形式多网格 ANN 方法

Neural Computing and Applications Pub Date : 2024-08-19 DOI: 10.1007/s00521-024-10311-1

Jeong-Kweon Seo

{"title":"A reduced-form multigrid approach for ANN equivalent to classic multigrid expansion","authors":"Jeong-Kweon Seo","doi":"10.1007/s00521-024-10311-1","DOIUrl":"https://doi.org/10.1007/s00521-024-10311-1","url":null,"abstract":"In this paper, we investigate the method of solving partial differential equations (PDEs) using artificial neural network (ANN) structures, which have been actively applied in artificial intelligence models. The ANN model for solving PDEs offers the advantage of providing explicit and continuous solutions. However, the ANN model for solving PDEs cannot construct a conventionally solvable linear system with known matrix solvers; thus, computational speed could be a significant concern. We study the implementation of the multigrid method, developing a general concept for a coarse-grid correction method to be integrated into the ANN-PDE architecture, with the goal of enhancing computational efficiency. By developing a reduced form of the multigrid method for ANN, we demonstrate that it can be interpreted as an equivalent representation of the classic multigrid expansion. We validated the applicability of the proposed method through rigorous experiments, which included analyzing loss decay and the number of iterations along with improvements in terms of accuracy, speed, and complexity. We accomplished this by employing the gradient descent method and the Broyden–Fletcher–Goldfarb–Shanno (BFGS) method to update the gradients while solving the given ANN systems of PDEs.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"38 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Extensive evaluation of image classifiers’ interpretations 对图像分类器的解释进行广泛评估

Neural Computing and Applications Pub Date : 2024-08-16 DOI: 10.1007/s00521-024-10273-4

Suraja Poštić, Marko Subašić

{"title":"Extensive evaluation of image classifiers’ interpretations","authors":"Suraja Poštić, Marko Subašić","doi":"10.1007/s00521-024-10273-4","DOIUrl":"https://doi.org/10.1007/s00521-024-10273-4","url":null,"abstract":"Saliency maps are input-resolution matrices used for visualizing local interpretations of image classifiers. Their pixel values reflect the importance of corresponding image locations for the model’s decision. Despite numerous proposals on how to obtain such maps, their evaluation remains an open question. This paper presents a carefully designed experimental procedure along with a set of quantitative interpretation evaluation metrics that rely solely on the original model behavior. Previously noticed evaluation biases have been attenuated by separating locations with high and low values, considering the full saliency map resolution, and using classifiers with diverse accuracies and all the classes in the dataset. We used the proposed evaluation metrics to compare and analyze seven well-known interpretation methods. Our experiments confirm the importance of object background as well as negative saliency map pixels, and we show that the scale of their impact on the model is comparable to that of positive ones. We also demonstrate that a good class score interpretation does not necessarily imply a good probability interpretation. DeepLIFT and LRP-(epsilon) methods proved most successful altogether, while Grad-CAM and Ablation-CAM performed very poorly, even in the detection of positive relevance. The retention of positive values alone in the latter two methods was responsible for the inaccurate detection of irrelevant locations as well.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"8 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DONN: leveraging heterogeneous outer products for CTR prediction DONN：利用异构外部产品进行点击率预测

Neural Computing and Applications Pub Date : 2024-08-16 DOI: 10.1007/s00521-024-10296-x

Tae-Suk Kim

{"title":"DONN: leveraging heterogeneous outer products for CTR prediction","authors":"Tae-Suk Kim","doi":"10.1007/s00521-024-10296-x","DOIUrl":"https://doi.org/10.1007/s00521-024-10296-x","url":null,"abstract":"A primary strategy for constructing click-through rate models based on deep learning involves combining a multi-layer perceptron (MLP) with custom networks that can effectively capture the interactions between different features. This is due to the widespread recognition that relying solely on a vanilla MLP network is not effective in acquiring knowledge about multiplicative feature interactions. These custom networks often employ product methods, such as inner, Hadamard, and outer products, to construct dedicated architectures for this purpose. Among these methods, the outer product has shown superiority in capturing feature interactions. However, the resulting quadratic form from the outer product operation limits the conveyance of informative higher-order interactions to the MLP. Efforts to address this limitation have led to models attempting to increase interaction degrees to higher orders. However, utilizing matrix factorization techniques to reduce learning parameters has resulted in information loss and decreased performance. Furthermore, previous studies have constrained the MLP’s potential by providing it with inputs consisting of homogeneous outer products, thus limiting available information diversity. To overcome these challenges, we introduce DONN, a model that leverages a composite-wise bilinear module incorporating factorized bilinear pooling to mitigate information loss and facilitate higher-order interaction development. Additionally, DONN utilizes a feature-wise bilinear module for outer product computations between feature pairs, augmenting the MLP with combined information. By employing heterogeneous outer products, DONN enhances the MLP’s prediction capabilities, enabling the recognition of additional nonlinear interdependencies. Our evaluation on two benchmark datasets demonstrates that DONN surpasses state-of-the-art models in terms of performance.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"393 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing human-like multimodal reasoning: a new challenging dataset and comprehensive framework 增强类人多模态推理：新的挑战性数据集和综合框架

Neural Computing and Applications Pub Date : 2024-08-16 DOI: 10.1007/s00521-024-10310-2

Jingxuan Wei, Cheng Tan, Zhangyang Gao, Linzhuang Sun, Siyuan Li, Bihui Yu, Ruifeng Guo, Stan Z. Li

{"title":"Enhancing human-like multimodal reasoning: a new challenging dataset and comprehensive framework","authors":"Jingxuan Wei, Cheng Tan, Zhangyang Gao, Linzhuang Sun, Siyuan Li, Bihui Yu, Ruifeng Guo, Stan Z. Li","doi":"10.1007/s00521-024-10310-2","DOIUrl":"https://doi.org/10.1007/s00521-024-10310-2","url":null,"abstract":"Multimodal reasoning is a critical component in the pursuit of artificial intelligence systems that exhibit human-like intelligence, especially when tackling complex tasks. While the chain-of-thought (CoT) technique has gained considerable attention, the existing ScienceQA dataset, primarily focused on multimodal scientific questions and explanations from elementary and high school textbooks, exhibits limitations in providing a comprehensive evaluation across a broader spectrum of open-domain questions. To address this gap, we introduce the COCO Multi-Modal Reasoning (COCO-MMR) dataset, a comprehensive collection of open-ended questions, rationales, and answers derived from the COCO dataset. Unlike previous datasets that rely on multiple-choice questions, our dataset utilizes open-ended questions to more effectively challenge and assess CoT models’ reasoning capabilities. Through comprehensive evaluations and detailed analyses, we demonstrate that our multihop cross-modal attention and sentence-level contrastive learning modules, designed to simulate human thought processes, significantly enhance model comprehension abilities. Experiments confirm the proposed dataset and techniques, showing their potential to advance multimodal reasoning. The data and code are available at https://github.com/weijingxuan/COCO-MMR.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"23 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Automated evaluation and parameter estimation of brain tumor using deep learning techniques 利用深度学习技术对脑肿瘤进行自动评估和参数估计

Neural Computing and Applications Pub Date : 2024-08-16 DOI: 10.1007/s00521-024-10255-6

B. Vijayakumari, N. Kiruthiga, C. P. Bushkala

{"title":"Automated evaluation and parameter estimation of brain tumor using deep learning techniques","authors":"B. Vijayakumari, N. Kiruthiga, C. P. Bushkala","doi":"10.1007/s00521-024-10255-6","DOIUrl":"https://doi.org/10.1007/s00521-024-10255-6","url":null,"abstract":"The identification and region extraction of brain tumors is an essential aspect of clinical image analysis and the diagnosis of brain-related illnesses. The precise and accurate identification of tumors from MRI images is particularly significant in the effective formulating of treatments such as surgery, radiation therapy, and drug therapy. The challenge of segmentation stems from the variability in the size, location, and appearance of tumors, making it a complex task. Various segmentation and classification techniques have been created and designed for brain tumor diagnosis; however, these traditional techniques are time-consuming and subjective and require expertise in image processing. In recent times, deep learning-based approaches have shown promising results in brain tumor segmentation. This research aims to develop a brain tumor segmentation and classification model that enables medical professionals to locate and measure tumors accurately and develop effective treatment and rehabilitation strategies. The process involves segmenting the tumor and further classifying it into its two major types. The parameter estimation from the segmented output provides an insight that is pivotal in the evaluation of MRI brain tumors. With further research and development, deep learning-based segmentation and classification could become an important tool for accurate detection and evaluation of brain tumors. The development of deep learning-based segmentation and classification methods can greatly benefit the medical community, and according to the finding from the experiment, it is shown that the proposed framework excels in brain tumor segmentation and classification with an accuracy of 99.3%.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"20 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Simulation of the behavior of fine and gross motor skills of an individual with motor disabilities 模拟运动残疾者的精细和粗大运动技能行为

Neural Computing and Applications Pub Date : 2024-08-16 DOI: 10.1007/s00521-024-10267-2

Karla K. Sánchez-Torres, Suemi Rodríguez-Romo

{"title":"Simulation of the behavior of fine and gross motor skills of an individual with motor disabilities","authors":"Karla K. Sánchez-Torres, Suemi Rodríguez-Romo","doi":"10.1007/s00521-024-10267-2","DOIUrl":"https://doi.org/10.1007/s00521-024-10267-2","url":null,"abstract":"We have developed a neural network model that imitates the central nervous system’s control of motor sensors (Sánchez-Torres and Rodríguez-Romo in Neurocomputing 581:127511, 2024). Our research explored various levels of connectivity in our neural network related to neuroplasticity in the central nervous system. We have conducted a study comparing healthy individuals to those with motor impairments by utilizing reinforcement learning and transfer entropy. In our previous research (Sánchez-Torres and Rodríguez-Romo in Neurocomputing 581:127511, 2024), we have simulated human walking while encountering obstacles as an instance of gross motor activities. Now, we have used the same model to simulate fine motor activities. Our goal is to identify differences in information transmission between gross and fine motor activities among healthy individuals and those with motor impairments by evaluating the effective connectivity of our network. To regulate learning accuracy in our model, we introduced a variable called numClusterToFire. However, we discovered that the value for this variable requires careful calibration. If the value is too small, agent exploration is insufficient, and network learning is inefficient. Conversely, learning times increase exponentially, often unnecessarily if the value is too large. We conducted simulations for gross and fine motor skills using three different numClusterToFire values and found that as we increased numClusterToFire, the time required for the network to memorize the outputs for each of the objects in the test set also increased. Our findings indicate that in gross motor skills, which do not require precision, changes in the numClusterToFire variable do not affect information transfer behavior. Conversely, in fine motor skills, information transfer decreases as numClusterToFire increases. On the other hand, our model revealed that for healthy and disabled individuals, the transfer of information between the input layer and the first hidden layer is higher for fine motor skills; this important biological fact suggests the influence of external cues in performing this activity successfully. Additionally, our neural network model showed that movements that do not require precision do not necessarily require a high level of neuroplasticity. Increasing neuroplasticity may cause some neurons to transmit more information than others. Whereas, increasing neuroplasticity through practice is essential for precise movements like fine motor skills. We also found that information transfer in the network’s hidden layers is similar for fine and gross motor activities, as we observed identical patterns. However, the distribution and proportion of these patterns differ, concluding that more neurons are involved in fine motor activities, and more information is transferred compared to gross motor activities. Finally, a pattern was observed in the transfer of information in the last hidden lay","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"37 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A proposed framework for crop yield prediction using hybrid feature selection approach and optimized machine learning 利用混合特征选择方法和优化机器学习进行作物产量预测的拟议框架

Neural Computing and Applications Pub Date : 2024-08-16 DOI: 10.1007/s00521-024-10226-x

Mahmoud Abdel-salam, Neeraj Kumar, Shubham Mahajan

{"title":"A proposed framework for crop yield prediction using hybrid feature selection approach and optimized machine learning","authors":"Mahmoud Abdel-salam, Neeraj Kumar, Shubham Mahajan","doi":"10.1007/s00521-024-10226-x","DOIUrl":"https://doi.org/10.1007/s00521-024-10226-x","url":null,"abstract":"Accurately predicting crop yield is essential for optimizing agricultural practices and ensuring food security. However, existing approaches often struggle to capture the complex interactions between various environmental factors and crop growth, leading to suboptimal predictions. Consequently, identifying the most important feature is vital when leveraging Support Vector Regressor (SVR) for crop yield prediction. In addition, the manual tuning of SVR hyperparameters may not always offer high accuracy. In this paper, we introduce a novel framework for predicting crop yields that address these challenges. Our framework integrates a new hybrid feature selection approach with an optimized SVR model to enhance prediction accuracy efficiently. The proposed framework comprises three phases: preprocessing, hybrid feature selection, and prediction phases. In preprocessing phase, data normalization is conducted, followed by an application of K-means clustering in conjunction with the correlation-based filter (CFS) to generate a reduced dataset. Subsequently, in the hybrid feature selection phase, a novel hybrid FMIG-RFE feature selection approach is proposed. Finally, the prediction phase introduces an improved variant of Crayfish Optimization Algorithm (COA), named ICOA, which is utilized to optimize the hyperparameters of SVR model thereby achieving superior prediction accuracy along with the novel hybrid feature selection approach. Several experiments are conducted to assess and evaluate the performance of the proposed framework. The results demonstrated the superior performance of the proposed framework over state-of-art approaches. Furthermore, experimental findings regarding the ICOA optimization algorithm affirm its efficacy in optimizing the hyperparameters of SVR model, thereby enhancing both prediction accuracy and computational efficiency, surpassing existing algorithms.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"23 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A comprehensive review of hybrid AC/DC networks: insights into system planning, energy management, control, and protection 交直流混合网络综合评述：对系统规划、能源管理、控制和保护的见解

Neural Computing and Applications Pub Date : 2024-08-16 DOI: 10.1007/s00521-024-10264-5

Mohamed I. Abdelwanis, Mohammed I. Elmezain

{"title":"A comprehensive review of hybrid AC/DC networks: insights into system planning, energy management, control, and protection","authors":"Mohamed I. Abdelwanis, Mohammed I. Elmezain","doi":"10.1007/s00521-024-10264-5","DOIUrl":"https://doi.org/10.1007/s00521-024-10264-5","url":null,"abstract":"The introduction of hybrid alternating current (AC)/direct current (DC) distribution networks led to several developments in smart grid and decentralized power system technology. The paper concentrates on several topics related to the operation of hybrid AC/DC networks. Such as optimization methods, control strategies, energy management, protection issues, and proposed solutions. The implementation of neural network optimization methods has great importance for the successful integration of multiple energy sources, dynamic energy management, establishment of system stability and reliability, power distribution optimization, management of energy storage, and online fault detection and diagnosis in hybrid networks like the hybrid AC–DC microgrids (MG). Taking advantage of renewable energy generation and cost-cutting through the neural network optimization technique holds the key to these progressions. Besides identifying the challenges in the operation of a hybrid system, the paper also compares this system to conventional MGs and shows the benefits of this type of system over different MG structures. This review compares the different topologies, particularly looking at the AC–DC coupled hybrid MGs, and shows the important role of the interlinking of converters that are used for efficient transmission between AC and DC MGs and generally used to implement the different control and optimization techniques. Overall, this review paper can be regarded as a reference, pointing out the pros and cons of integrating hybrid AC/DC distribution networks for future study and improvement paths in this developing area.","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"33 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142188357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Circuit topology aware GNN-based multi-variable model for DC-DC converters dynamics prediction in CCM and DCM 基于 GNN 的电路拓扑感知多变量模型，用于 CCM 和 DCM 中 DC-DC 转换器的动态预测

Neural Computing and Applications Pub Date : 2024-08-16 DOI: 10.1007/s00521-024-10293-0

Ahmed K. Khamis, Mohammed Agamy

引用次数: 0