{"title":"核心精度:基于物联网的垂直联合学习方法,用于异构数据驱动的心血管疾病风险预测。","authors":"Sulfikar Shajimon , Raj Mani Shukla , Amar Nath Patra","doi":"10.1016/j.cmpb.2025.109079","DOIUrl":null,"url":null,"abstract":"<div><h3>Background:</h3><div>Cardiovascular disease (CVD) seriously threatens individual health, highlighting the importance of early detection and proactive mitigation. With advances in consumer electronics such as wearables and IoT, an opportunity exists to enhance CVD prediction for users. Machine learning (ML) has been widely used for predicting CVD risk (high/low) based on various factors and is a critical area of healthcare research. However, sharing data needed to predict CVD with machine learning models is challenging due to privacy concerns. Federated learning (FL) enables distributed training of ML models without sharing raw data. However, it requires that all training features be available to all clients.</div></div><div><h3>Methods:</h3><div>To address this problem, we propose a method based on vertical federated learning (VFL) designed for consumer electronics platforms. The proposed method trains the Neural Network (NN) model in a distributed manner in which different parties hold different data features. In this work, each party maintains a portion of separate data features, performs calculations on them locally, and then transfers only the necessary information to train a NN model jointly. We employ the proposed method for different use cases where the dataset features are distributed between: (i) the patient and the hospital (2-splits); (ii) the patient, the doctor, and the laboratory (3-splits); and (iii) the patient, the doctor, the Electrocardiogram (ECG) center, and the laboratory (4-splits).</div></div><div><h3>Results:</h3><div>Using a realistic dataset publicly available, we test the proposed methodology, which gives around 90% accuracy, precision, recall, and f-score. It also does not need clients to possess the same features as compared to traditional Federated Learning.</div></div><div><h3>Conclusion:</h3><div>This paper has an impact on the healthcare sector, where user data privacy is of utmost concern. This advancement will improve the ability of the healthcare sector to diagnose various diseases and contribute to the field of AI by improving distributed AI algorithms.</div></div>","PeriodicalId":10624,"journal":{"name":"Computer methods and programs in biomedicine","volume":"273 ","pages":"Article 109079"},"PeriodicalIF":4.8000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Precision at heart: An IoT-based vertical federated learning approach for heterogeneous data-driven cardiovascular disease risk prediction\",\"authors\":\"Sulfikar Shajimon , Raj Mani Shukla , Amar Nath Patra\",\"doi\":\"10.1016/j.cmpb.2025.109079\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background:</h3><div>Cardiovascular disease (CVD) seriously threatens individual health, highlighting the importance of early detection and proactive mitigation. With advances in consumer electronics such as wearables and IoT, an opportunity exists to enhance CVD prediction for users. Machine learning (ML) has been widely used for predicting CVD risk (high/low) based on various factors and is a critical area of healthcare research. However, sharing data needed to predict CVD with machine learning models is challenging due to privacy concerns. Federated learning (FL) enables distributed training of ML models without sharing raw data. However, it requires that all training features be available to all clients.</div></div><div><h3>Methods:</h3><div>To address this problem, we propose a method based on vertical federated learning (VFL) designed for consumer electronics platforms. The proposed method trains the Neural Network (NN) model in a distributed manner in which different parties hold different data features. In this work, each party maintains a portion of separate data features, performs calculations on them locally, and then transfers only the necessary information to train a NN model jointly. We employ the proposed method for different use cases where the dataset features are distributed between: (i) the patient and the hospital (2-splits); (ii) the patient, the doctor, and the laboratory (3-splits); and (iii) the patient, the doctor, the Electrocardiogram (ECG) center, and the laboratory (4-splits).</div></div><div><h3>Results:</h3><div>Using a realistic dataset publicly available, we test the proposed methodology, which gives around 90% accuracy, precision, recall, and f-score. It also does not need clients to possess the same features as compared to traditional Federated Learning.</div></div><div><h3>Conclusion:</h3><div>This paper has an impact on the healthcare sector, where user data privacy is of utmost concern. This advancement will improve the ability of the healthcare sector to diagnose various diseases and contribute to the field of AI by improving distributed AI algorithms.</div></div>\",\"PeriodicalId\":10624,\"journal\":{\"name\":\"Computer methods and programs in biomedicine\",\"volume\":\"273 \",\"pages\":\"Article 109079\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2025-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer methods and programs in biomedicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0169260725004961\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer methods and programs in biomedicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169260725004961","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Precision at heart: An IoT-based vertical federated learning approach for heterogeneous data-driven cardiovascular disease risk prediction
Background:
Cardiovascular disease (CVD) seriously threatens individual health, highlighting the importance of early detection and proactive mitigation. With advances in consumer electronics such as wearables and IoT, an opportunity exists to enhance CVD prediction for users. Machine learning (ML) has been widely used for predicting CVD risk (high/low) based on various factors and is a critical area of healthcare research. However, sharing data needed to predict CVD with machine learning models is challenging due to privacy concerns. Federated learning (FL) enables distributed training of ML models without sharing raw data. However, it requires that all training features be available to all clients.
Methods:
To address this problem, we propose a method based on vertical federated learning (VFL) designed for consumer electronics platforms. The proposed method trains the Neural Network (NN) model in a distributed manner in which different parties hold different data features. In this work, each party maintains a portion of separate data features, performs calculations on them locally, and then transfers only the necessary information to train a NN model jointly. We employ the proposed method for different use cases where the dataset features are distributed between: (i) the patient and the hospital (2-splits); (ii) the patient, the doctor, and the laboratory (3-splits); and (iii) the patient, the doctor, the Electrocardiogram (ECG) center, and the laboratory (4-splits).
Results:
Using a realistic dataset publicly available, we test the proposed methodology, which gives around 90% accuracy, precision, recall, and f-score. It also does not need clients to possess the same features as compared to traditional Federated Learning.
Conclusion:
This paper has an impact on the healthcare sector, where user data privacy is of utmost concern. This advancement will improve the ability of the healthcare sector to diagnose various diseases and contribute to the field of AI by improving distributed AI algorithms.
期刊介绍:
To encourage the development of formal computing methods, and their application in biomedical research and medical practice, by illustration of fundamental principles in biomedical informatics research; to stimulate basic research into application software design; to report the state of research of biomedical information processing projects; to report new computer methodologies applied in biomedical areas; the eventual distribution of demonstrable software to avoid duplication of effort; to provide a forum for discussion and improvement of existing software; to optimize contact between national organizations and regional user groups by promoting an international exchange of information on formal methods, standards and software in biomedicine.
Computer Methods and Programs in Biomedicine covers computing methodology and software systems derived from computing science for implementation in all aspects of biomedical research and medical practice. It is designed to serve: biochemists; biologists; geneticists; immunologists; neuroscientists; pharmacologists; toxicologists; clinicians; epidemiologists; psychiatrists; psychologists; cardiologists; chemists; (radio)physicists; computer scientists; programmers and systems analysts; biomedical, clinical, electrical and other engineers; teachers of medical informatics and users of educational software.