Faiza Ajmi , Faten Ajmi , Sarah Ben Othman , Hayfa Zgaya-Biau , Mariagrazia Dotoli , Jean-Marie Renard , Slim Hammadi
{"title":"Real time patient scheduling orchestration for improving key performance indicators in a hospital emergency department","authors":"Faiza Ajmi , Faten Ajmi , Sarah Ben Othman , Hayfa Zgaya-Biau , Mariagrazia Dotoli , Jean-Marie Renard , Slim Hammadi","doi":"10.1016/j.jocs.2024.102422","DOIUrl":"10.1016/j.jocs.2024.102422","url":null,"abstract":"<div><p>Healthcare systems worldwide are increasingly subject to in-depth analysis. Problems in healthcare systems are of concern to the general public. For example, overcrowding in emergency departments creates several issues including longer waiting times, more frequent medical errors, a longer length of stay and worsened performance indicators. Overcrowding situations reduce the availability of staff and material resources, and therefore deteriorate the quality of care. The main cause of the overcrowding in emergency departments is the permanent interferences between the scheduled patients, unscheduled patients and urgent and unscheduled patients arriving at the emergency department. The objective of the present study is to develop an innovative decision support system that minimizes these interferences, while taking into account the perturbations that can occur throughout the day. The research’s ultimate goal is to improve the performance indicators via two processes: the first is a memetic algorithm based on a four dimensional hypercube genetic algorithm and local search techniques, and the second is based on a multi-agent system which dynamically orchestrates the patient pathway (given by the scheduling algorithm). In order to test and validate our approach, experiments are designed with real data from the adult emergency department at Lille University Medical Center. Simulations showed that with our approach we were able to reduce the waiting time of patients by 28.12%.</p></div>","PeriodicalId":48907,"journal":{"name":"Journal of Computational Science","volume":"82 ","pages":"Article 102422"},"PeriodicalIF":3.1,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142058323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Investigating the interaction between EEG and fNIRS: A multimodal network analysis of brain connectivity","authors":"Rosmary Blanco , Cemal Koba , Alessandro Crimi","doi":"10.1016/j.jocs.2024.102416","DOIUrl":"10.1016/j.jocs.2024.102416","url":null,"abstract":"<div><p>The brain is a complex system with functional and structural networks. Different neuroimaging methods have been developed to explore these networks, but each method has its own unique strengths and limitations, depending on the signals they measure. Combining techniques like electroencephalography (EEG) and functional near-infrared spectroscopy (fNIRS) has gained interest, but understanding how the information derived from these modalities is related to each other remains an exciting open question. The multilayer network model has emerged as a promising approach to integrate different sources data. In this study, we investigated the hemodynamic and electrophysiological data captured by fNIRS and EEG to compare brain network topologies derived from each modality, examining how these topologies vary between resting state (RS) and task-related conditions. Additionally, we adopted the multilayer network model to integrate EEG and fNIRS data and evaluate the benefits of combining multiple modalities compared to using a single modality in capturing characteristic brain functioning.</p><p>A small-world network structure was observed in the rest, right motor imagery, and left motor imagery tasks in both modalities. We found that EEG captures faster changes in neural activity, thus providing a more precise estimation of the timing of information transfer between brain regions in RS. fNIRS provides insights into the slower hemodynamic responses associated with longer-lasting and sustained neural processes in cognitive tasks. The multilayer approach outperformed unimodal analyses, offering a richer understanding of brain function. Complementarity between EEG and fNIRS was observed, particularly during tasks, as well as a certain level of redundancy and complementarity between the multimodal and the unimodal approach, which depends on the modality and the specific brain state. Overall, the results highlight differences in how EEG and fNIRS capture brain network topology in RS and tasks and emphasize the value of integrating multiple modalities for a comprehensive view of brain connectivity and function.</p></div>","PeriodicalId":48907,"journal":{"name":"Journal of Computational Science","volume":"82 ","pages":"Article 102416"},"PeriodicalIF":3.1,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1877750324002096/pdfft?md5=71eed64dc88649cf2f98e5b6d2bb1fed&pid=1-s2.0-S1877750324002096-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142049119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Some notes on the basic concepts of support vector machines","authors":"Yongping Wang , Wenjing Liao , Hongting Shen , Zilong Jiang , Jincheng Zhou","doi":"10.1016/j.jocs.2024.102390","DOIUrl":"10.1016/j.jocs.2024.102390","url":null,"abstract":"<div><p>Support vector machines (SVMs) are classic binary classification algorithms and have been shown to be a robust and well-behaved technique for classification in many real-world problems. However, there are ambiguities in the basic concepts of SVMs although these ambiguities do not affect the effectiveness of SVMs. Corinna Cortes and Vladimir Vapnik, who presented SVMs in 1995, pointed out that an SVM predicts through a hyperplane with a maximal margin. However existing literatures have two different definitions of the margin. On the other hand, Corinna Cortes and Vladimir Vapnik converted an SVM into an optimization problem that is much easier to solve. Nevertheless, existing papers do not explain how the optimization problem derives from an SVM well. These ambiguities may cause certain troubles in understanding the basic concepts of SVMs. For this purpose, this paper defines a separating hyperplane of a training data set and, hence, an optimal separating hyperplane of the set. The two definitions are reasonable since this paper proves that <span><math><mrow><msubsup><mrow><mtext>w</mtext></mrow><mrow><mn>0</mn></mrow><mrow><mtext>T</mtext></mrow></msubsup><mtext>x</mtext><mo>+</mo><msub><mrow><mi>b</mi></mrow><mrow><mn>0</mn></mrow></msub><mo>=</mo><mn>0</mn></mrow></math></span> is an optimal separating hyperplane of a training data set when <span><math><msub><mrow><mtext>w</mtext></mrow><mrow><mn>0</mn></mrow></msub></math></span> and <span><math><msub><mrow><mi>b</mi></mrow><mrow><mn>0</mn></mrow></msub></math></span> constitute a solution to the above optimization problem. Some notes on the above margin and optimization problem are given based on the two definitions. These notes should be meaningful for clarifying the basic concepts of SVMs.</p></div>","PeriodicalId":48907,"journal":{"name":"Journal of Computational Science","volume":"82 ","pages":"Article 102390"},"PeriodicalIF":3.1,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1877750324001832/pdfft?md5=e5c1cc2cfe92cdf160c7da2829fc6cb5&pid=1-s2.0-S1877750324001832-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141984969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Modelling the queues of connected and autonomous vehicles at signal-free intersections considering the correlated vehicle arrivals","authors":"Qiaoli Yang, Jiaqi Zhang","doi":"10.1016/j.jocs.2024.102420","DOIUrl":"10.1016/j.jocs.2024.102420","url":null,"abstract":"<div><p>Advances in connected and autonomous vehicle (CAV) technologies have made signal-free intersections a viable option for enhancing traffic performance. In the absence of traffic signal control, sequencing control strategies become crucial to ensuring the safety and efficiency of conflicting traffic flows at these intersections. The First-Come-First-Serve (FCFS) and Longest-Queue-First (LQF) strategies have received significant attention as fundamental approaches to managing connected and automated vehicles at signal-free intersections, serving as baselines for evaluating innovative strategies. However, the impact of varying traffic demand in conflicting directions on the volatility of CAV queues at signal-free intersections remains unclear, and there is a lack of analytical quantitative estimates on how these two fundamental sequencing strategies affect fairness within CAV queues. Furthermore, in urban road networks, CAVs entering a downstream intersection typically originate from an upstream intersection, and thus CAVs typically move in bunching and correlation. However, this phenomenon has received little attention in the modelling of CAV queues. To this end, in this paper, by virtue of the salient advantage of the Markovian Arrival Process (MAP) in describing the bunching and correlated arrival properties, an MAP-based double-input queueing model and its computational framework are developed to estimate the queueing process of CAVs at signal-free intersections. Some basic statistical metrics, such as queue length, delay, conditional queue length, and queue length variance, are derived. Additionally, numerical experiments are conducted to examine the queueing performance of FCFS and LQF strategies under different traffic conditions. The results suggest that the effectiveness of FCFS and LQF strategies varies depending on the level of traffic demand in the conflicting directions.</p></div>","PeriodicalId":48907,"journal":{"name":"Journal of Computational Science","volume":"82 ","pages":"Article 102420"},"PeriodicalIF":3.1,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142129571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Christian Tantardini , Hayk A. Zakaryan , Zhong-Kang Han , Tariq Altalhi , Sergey V. Levchenko , Alexander G. Kvashnin , Boris I. Yakobson
{"title":"Material hardness descriptor derived by symbolic regression","authors":"Christian Tantardini , Hayk A. Zakaryan , Zhong-Kang Han , Tariq Altalhi , Sergey V. Levchenko , Alexander G. Kvashnin , Boris I. Yakobson","doi":"10.1016/j.jocs.2024.102402","DOIUrl":"10.1016/j.jocs.2024.102402","url":null,"abstract":"<div><p>Hardness is a materials’ property with implications in several industrial fields, including oil and gas, manufacturing, and others. However, the relationship between this macroscale property and atomic (i.e., microscale) properties is unknown and in the last decade several models have unsuccessfully tried to correlate them in a wide range of chemical space. The understanding of such relationship is of fundamental importance for discovery of harder materials with specific characteristics to be employed in a wide range of fields. In this work, we have found a physical descriptor for Vickers hardness using a symbolic-regression artificial-intelligence approach based on compressed sensing. SISSO (Sure Independence Screening plus Sparsifying Operator) is an artificial-intelligence algorithm used for discovering simple and interpretable predictive models. It performs feature selection from up to billions of candidates obtained from several primary features by applying a set of mathematical operators. The resulting sparse SISSO model accurately describes the target property (i.e., Vickers hardness) with minimal complexity. We have considered the experimental values of hardness for binary, ternary, and quaternary transition-metal borides, carbides, nitrides, carbonitrides, carboborides, and boronitrides of 61 materials, on which the fitting was performed.. The found descriptor is a non-linear function of the microscopic properties, with the most significant contribution being from a combination of Voigt-averaged bulk modulus, Poisson’s ratio, and Reuss-averaged shear modulus. Results of high-throughput screening of 635 candidate materials using the found descriptor suggest the enhancement of material’s hardness through mixing with harder yet metastable structures (e.g., metastable VN, TaN, ReN<span><math><msub><mrow></mrow><mrow><mn>2</mn></mrow></msub></math></span>, Cr<span><math><msub><mrow></mrow><mrow><mn>3</mn></mrow></msub></math></span>N<span><math><msub><mrow></mrow><mrow><mn>4</mn></mrow></msub></math></span>, and ZrB<span><math><msub><mrow></mrow><mrow><mn>6</mn></mrow></msub></math></span> all exhibit high hardness).</p></div>","PeriodicalId":48907,"journal":{"name":"Journal of Computational Science","volume":"82 ","pages":"Article 102402"},"PeriodicalIF":3.1,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1877750324001959/pdfft?md5=01248d5bc6185e230ddb5469bb838119&pid=1-s2.0-S1877750324001959-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142136620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Step-based checkpointing with high-level algorithmic differentiation","authors":"James R. Maddison","doi":"10.1016/j.jocs.2024.102405","DOIUrl":"10.1016/j.jocs.2024.102405","url":null,"abstract":"<div><p>Automated code generation allows for a separation between the development of a model, expressed via a domain specific language, and lower level implementation details. Algorithmic differentiation can be applied symbolically at the level of the domain specific language, and the code generator reused to implement code required for an adjoint calculation. However the adjoint calculations are complicated by the well-known problem of storing or recomputing the forward data required by the adjoint, and different checkpointing strategies have been developed to tackle this problem. This article considers the combination of high-level algorithmic differentiation with step-based checkpointing schedules, with the primary application being for solvers of time-dependent partial differential equations. The focus is on algorithmic differentiation using a dynamically constructed record of forward operations, where the precise structure of the original forward calculation is unknown ahead-of-time. In addition, high-level approaches provide a simplified view of the model itself. This allows data required to restart and advance the forward, and data required to advance the adjoint, to be identified. The difference between the two types of data is here leveraged to implement checkpointing strategies with improved performance.</p></div>","PeriodicalId":48907,"journal":{"name":"Journal of Computational Science","volume":"82 ","pages":"Article 102405"},"PeriodicalIF":3.1,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1877750324001984/pdfft?md5=6f935bc44600d9170907d962ee7163e7&pid=1-s2.0-S1877750324001984-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141939217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Aayush Pandey , Jeevesh Mahajan , Srinag P. , Aditya Rastogi , Arnab Roy , Partha P. Chakrabarti
{"title":"DARSI: A deep auto-regressive time series inference architecture for forecasting of aerodynamic parameters","authors":"Aayush Pandey , Jeevesh Mahajan , Srinag P. , Aditya Rastogi , Arnab Roy , Partha P. Chakrabarti","doi":"10.1016/j.jocs.2024.102401","DOIUrl":"10.1016/j.jocs.2024.102401","url":null,"abstract":"<div><p>In the realm of fluid mechanics, where computationally-intensive simulations demand significant time investments, especially in predicting aerodynamic coefficients, the conventional use of time series forecasting techniques becomes paramount. Existing methods prove effective with periodic time series, yet the challenge escalates when faced with aperiodic or chaotic system responses. To address this challenge, we introduce DARSI (Deep Auto-Regressive Time Series Inference), an advanced architecture and an efficient hybrid of Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) components. Evaluated against established architectures (CNN, DLinear, LSTM, LSTNet, and PatchTST) for forecasting Coefficient of Lift (<span><math><msub><mrow><mi>C</mi></mrow><mrow><mi>L</mi></mrow></msub></math></span>) values corresponding to Angles of Attack (AoAs) across periodic, aperiodic, and chaotic regimes, DARSI demonstrates remarkable performance, showing an average increase of 79.95% in CORR, 76.57% reduction in MAPE, 94.70% reduction in MSE, 76.18% reduction in QL, and 75.21% reduction in RRSE. Particularly adept at predicting chaotic aerodynamic coefficients, DARSI emerges as the best in static scenarios, surpassing DLinear and providing heightened reliability. In dynamic scenarios, DLinear takes the lead, with DARSI securing the second position alongside PatchTST. Furthermore, static AoAs at 24.7 are identified as the most chaotic, surpassing those at 24.9 and the study reveals a potential inflection point at AoA 24.7 in static scenarios for both DLinear and DARSI, warranting further confirmation. This research positions DARSI as an adept alternative to simulations, offering computational efficiency with significant implications for diverse time series forecasting applications across industries, particularly in advancing aerodynamic predictions in chaotic scenarios.</p></div>","PeriodicalId":48907,"journal":{"name":"Journal of Computational Science","volume":"82 ","pages":"Article 102401"},"PeriodicalIF":3.1,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142020720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Resonance modeling of the tsunami caused by the Aegean Sea Earthquake (Mw7.0) of October 30, 2020","authors":"Olcay Eğri̇boyun, Lale Balas","doi":"10.1016/j.jocs.2024.102398","DOIUrl":"10.1016/j.jocs.2024.102398","url":null,"abstract":"<div><p>The resonance of tsunami waves in semi-enclosed bays is paramount in understanding and mitigating the impact of seismic events on coastal communities. Semi-enclosed bays, characterized by their partial enclosure, can amplify the effects of incoming tsunami waves due to resonance behavior, where the natural frequencies of the bay correspond to those of the incoming waves. This resonance phenomenon can significantly increase wave height and inundation levels, posing an increased risk to nearby settlements and infrastructure. Understanding the resonance patterns in these bays is crucial for accurate hazard assessment, early warning systems, and effective disaster preparedness and response strategies. On October 30, 2020, an earthquake occurred between the Turkish Bay of Seferihisar Bay and the Greek island of Samos in the Aegean Sea. Long waves generated by the normal-faulting earthquake caused notable damage to settlements within Seferihisar Bay and the north coast of Samos Island. According to the measurements of the Syros mareograph stations, the wave heights were between 2 and 20 cm and wave periods between 9 and 20 seconds. Based on on-site survey reports conducted after the earthquake, inundation was reported in six settlements within Seferihisar Bay. However, inundation was notably higher in Sığacık and Akarca, reaching 2–3 times higher than in other locations, and the water level reached 2 m high. Given that the variance in inundation levels is attributed to resonance phenomena in Sığacık and Akarca rather than the propagation of tsunami waves, this study focused on conducting wave resonance modeling in Seferihisar Bay. The resonance modeling was performed using the RIDE wave model. Furthermore, the research has been expanded to assess the resonance patterns that might emerge in the event of an alternative earthquake or underwater landslide along the fault line responsible for the seismic event, encompassing wave periods ranging from T = 1–9 minutes and T = 20–30 minutes. Modeling results revealed that on the day of the earthquake, wave heights in Sığacık Marina and Akarca surged by 8.5 times in comparison to the wave height at the epicenter. This increase is notably higher, ranging from 2 to 2.5 times, compared to calculations made for other locations (Demircili, Altınköy, and Tepecik). Consequently, it was concluded that one of the reasons for the heightened effectiveness of inundation in Sığacık and Akarca was attributable to resonance. Moreover, supplementary investigations have indicated that waves with a period of T<9 minutes will pose higher risks for Demircili, Altınköy, Sığacık Marina, and Tepecik compared to the day of the earthquake. By comprehensively studying wave resonance in semi-enclosed bays, researchers and policymakers can better anticipate the potential impact of tsunami events and take measures to protect coastal communities, ultimately increasing resilience and reducing the loss of life and property in vulner","PeriodicalId":48907,"journal":{"name":"Journal of Computational Science","volume":"82 ","pages":"Article 102398"},"PeriodicalIF":3.1,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141939268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Exploring the integration of IoT and Generative AI in English language education: Smart tools for personalized learning experiences","authors":"Wanjin Dong , Daohua Pan , Soonbae Kim","doi":"10.1016/j.jocs.2024.102397","DOIUrl":"10.1016/j.jocs.2024.102397","url":null,"abstract":"<div><p>English language education is undergoing a transformative shift, propelled by advancements in technology. This research explores the integration of the Internet of Things (IoT) and Generative Artificial Intelligence (Generative AI) in the context of English language education, with a focus on developing a personalized oral assessment method. The proposed method leverages real-time data collection from IoT devices and Generative AI's language generation capabilities to create a dynamic and adaptive learning environment. The study addresses historical challenges in traditional teaching methodologies, emphasizing the need for AI approaches. The research objectives encompass a comprehensive exploration of the historical context, challenges, and existing technological interventions in English language education. A novel, technology-driven oral assessment method is designed, implemented, and rigorously evaluated using datasets such as Librispeech and L2Arctic. The ablation study investigates the impact of training dataset proportions and model learning rates on the method's performance. Results from the study highlight the importance of maintaining a balance in dataset proportions, selecting an optimal learning rate, and considering model depth in achieving optimal performance.</p></div>","PeriodicalId":48907,"journal":{"name":"Journal of Computational Science","volume":"82 ","pages":"Article 102397"},"PeriodicalIF":3.1,"publicationDate":"2024-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141998444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. Inés, C. Domínguez, J. Heras, G. Mata, J. Rubio
{"title":"A topological approach for semi-supervised learning","authors":"A. Inés, C. Domínguez, J. Heras, G. Mata, J. Rubio","doi":"10.1016/j.jocs.2024.102403","DOIUrl":"10.1016/j.jocs.2024.102403","url":null,"abstract":"<div><p>Nowadays, Machine Learning and Deep Learning methods have become the state-of-the-art approach to solve data classification tasks. In order to use those methods, it is necessary to acquire and label a considerable amount of data; however, this is not straightforward in some fields, since data annotation is time consuming and might require expert knowledge. This challenge can be tackled by means of semi-supervised learning methods that take advantage of both labelled and unlabelled data. In this work, we present new semi-supervised learning methods based on techniques from Topological Data Analysis (TDA). In particular, we have created two semi-supervised learning methods following two topological approaches. In the former, we have used a homological approach that consists in studying the persistence diagrams associated with the data using the bottleneck and Wasserstein distances. In the latter, we have considered the connectivity of the data. In addition, we have carried out a thorough analysis of the developed methods using 9 tabular datasets with low and high dimensionality. The results show that the developed semi-supervised methods outperform the results obtained with models trained with only manually labelled data, and are an alternative to other classical semi-supervised learning algorithms.</p></div>","PeriodicalId":48907,"journal":{"name":"Journal of Computational Science","volume":"82 ","pages":"Article 102403"},"PeriodicalIF":3.1,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141984968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}