Classification of Visit-to-Visit Blood Pressure Variability: A Machine Learning Approach for Data Clustering on Systolic Blood Pressure Intervention Trial (SPRINT)
K. Tsoi, Max W. Y. Lam, Felix C. H. Chan, H. W. Hirai, Baker K. K. Bat, Samuel Y. S. Wong, H. Meng
{"title":"Classification of Visit-to-Visit Blood Pressure Variability: A Machine Learning Approach for Data Clustering on Systolic Blood Pressure Intervention Trial (SPRINT)","authors":"K. Tsoi, Max W. Y. Lam, Felix C. H. Chan, H. W. Hirai, Baker K. K. Bat, Samuel Y. S. Wong, H. Meng","doi":"10.1145/3079452.3079454","DOIUrl":null,"url":null,"abstract":"Background: Blood pressure variability (BPV) is associated with the cardiovascular disease. However, there is no standard risk stratification method to evaluate BPV. Our study aims to cluster BPV into three levels, namely, low, medium and high levels, by a machine learning approach. Methods: The Systolic Blood Pressure Intervention Trial (SPRINT) dataset, which includes patients with hypertension or at risk of cardiovascular diseases, was obtained from a clinical data sharing platform. In the clinical trial, participants with systolic blood pressure (SBP) of at least 130 mmHg and an increased cardiovascular risk were randomized to receive intensive treatment (targeting SBP below 120 mmHg) or standard treatment (targeting SBP below 140 mmHg), and blood pressure (BP) were measured and recorded during the follow-up periods. Visit-to-visit BPV was measured by the deviation between the observed records and the personalized BP trends, and two-dimensional clustering on SBP and diastolic BP were applied. Different curve fitting techniques (linear regression and cubic regression) and clustering methods (K-means and Agglomerative Clustering) were attempted and compared with each other. Results: With 8,092 participants and a median follow-up of 3.26 years, linear regression was a simple and reliable method to capture the BP trend. K-means model showed stable data clustering results. Intensive treatment showed to be effective for participants with a high level of BPV. Conclusion: Machine learning can be used for data clustering on BPV.","PeriodicalId":245682,"journal":{"name":"Proceedings of the 2017 International Conference on Digital Health","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 International Conference on Digital Health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3079452.3079454","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Blood pressure variability (BPV) is associated with the cardiovascular disease. However, there is no standard risk stratification method to evaluate BPV. Our study aims to cluster BPV into three levels, namely, low, medium and high levels, by a machine learning approach. Methods: The Systolic Blood Pressure Intervention Trial (SPRINT) dataset, which includes patients with hypertension or at risk of cardiovascular diseases, was obtained from a clinical data sharing platform. In the clinical trial, participants with systolic blood pressure (SBP) of at least 130 mmHg and an increased cardiovascular risk were randomized to receive intensive treatment (targeting SBP below 120 mmHg) or standard treatment (targeting SBP below 140 mmHg), and blood pressure (BP) were measured and recorded during the follow-up periods. Visit-to-visit BPV was measured by the deviation between the observed records and the personalized BP trends, and two-dimensional clustering on SBP and diastolic BP were applied. Different curve fitting techniques (linear regression and cubic regression) and clustering methods (K-means and Agglomerative Clustering) were attempted and compared with each other. Results: With 8,092 participants and a median follow-up of 3.26 years, linear regression was a simple and reliable method to capture the BP trend. K-means model showed stable data clustering results. Intensive treatment showed to be effective for participants with a high level of BPV. Conclusion: Machine learning can be used for data clustering on BPV.