Fumin Wang , Nan Zhang , Xiaoning Wu , Wei Zhang , Qiang Lu , Rongqian Wu , Xu-Feng Zhang , Hui Guo , Yi Lv
{"title":"基于临床缺失数据集独立支持向量机的肝癌多特征权重因子提取及生存风险评估","authors":"Fumin Wang , Nan Zhang , Xiaoning Wu , Wei Zhang , Qiang Lu , Rongqian Wu , Xu-Feng Zhang , Hui Guo , Yi Lv","doi":"10.1016/j.iliver.2022.07.003","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>In clinical datasets, the characteristics of an individual patient vary so much that data loss becomes a normal event, which may be a unignorable dilemma in clinical data analysis. Therefore, the construction of a machine learning model aimed at missing clinical datasets (MCD) is of great clinical importance.</p></div><div><h3>Methods</h3><p>All included patients were divided into two groups according to outcome within a period of up to 36 months or less. The following characteristics (variables) were collected: age, sex, Child–Pugh status, hepatitis status, cirrhosis status, treatment, tumor size, portal vein tumor thrombus, and alpha fetoprotein (μg/mL), and a missing dataset-independent support vector machine (MDI-SVM) independent of missing data was built for the analysis.</p></div><div><h3>Results</h3><p>A MCD-independent SVM was developed based on clinical data from 1334 patients with hepatocellular carcinoma (HCC) at a single center, which had an accuracy of 84.43% in the survival analysis in the presence of 5% missing data. Based on the different combinations of features, our model calculated five features (tumor size, age, treatment, hepatitis status, and alpha fetoprotein) that had the greatest impact on survival in patients with HCC and extracted their weighting factors.</p></div><div><h3>Conclusions</h3><p>A MCD-independent SVM was developed to achieve prognosis prediction for patients with HCC in the absence of first-visit data.</p></div>","PeriodicalId":100657,"journal":{"name":"iLIVER","volume":"1 3","pages":"Pages 154-158"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772947822000408/pdfft?md5=8b358f4a4124596b0f47c2d8785d3028&pid=1-s2.0-S2772947822000408-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Multi-feature weight factor extraction and survival risk assessment of hepatocellular carcinoma based on a clinical missing dataset-independent support vector machine\",\"authors\":\"Fumin Wang , Nan Zhang , Xiaoning Wu , Wei Zhang , Qiang Lu , Rongqian Wu , Xu-Feng Zhang , Hui Guo , Yi Lv\",\"doi\":\"10.1016/j.iliver.2022.07.003\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><p>In clinical datasets, the characteristics of an individual patient vary so much that data loss becomes a normal event, which may be a unignorable dilemma in clinical data analysis. Therefore, the construction of a machine learning model aimed at missing clinical datasets (MCD) is of great clinical importance.</p></div><div><h3>Methods</h3><p>All included patients were divided into two groups according to outcome within a period of up to 36 months or less. The following characteristics (variables) were collected: age, sex, Child–Pugh status, hepatitis status, cirrhosis status, treatment, tumor size, portal vein tumor thrombus, and alpha fetoprotein (μg/mL), and a missing dataset-independent support vector machine (MDI-SVM) independent of missing data was built for the analysis.</p></div><div><h3>Results</h3><p>A MCD-independent SVM was developed based on clinical data from 1334 patients with hepatocellular carcinoma (HCC) at a single center, which had an accuracy of 84.43% in the survival analysis in the presence of 5% missing data. Based on the different combinations of features, our model calculated five features (tumor size, age, treatment, hepatitis status, and alpha fetoprotein) that had the greatest impact on survival in patients with HCC and extracted their weighting factors.</p></div><div><h3>Conclusions</h3><p>A MCD-independent SVM was developed to achieve prognosis prediction for patients with HCC in the absence of first-visit data.</p></div>\",\"PeriodicalId\":100657,\"journal\":{\"name\":\"iLIVER\",\"volume\":\"1 3\",\"pages\":\"Pages 154-158\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2772947822000408/pdfft?md5=8b358f4a4124596b0f47c2d8785d3028&pid=1-s2.0-S2772947822000408-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"iLIVER\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772947822000408\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"iLIVER","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772947822000408","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-feature weight factor extraction and survival risk assessment of hepatocellular carcinoma based on a clinical missing dataset-independent support vector machine
Background
In clinical datasets, the characteristics of an individual patient vary so much that data loss becomes a normal event, which may be a unignorable dilemma in clinical data analysis. Therefore, the construction of a machine learning model aimed at missing clinical datasets (MCD) is of great clinical importance.
Methods
All included patients were divided into two groups according to outcome within a period of up to 36 months or less. The following characteristics (variables) were collected: age, sex, Child–Pugh status, hepatitis status, cirrhosis status, treatment, tumor size, portal vein tumor thrombus, and alpha fetoprotein (μg/mL), and a missing dataset-independent support vector machine (MDI-SVM) independent of missing data was built for the analysis.
Results
A MCD-independent SVM was developed based on clinical data from 1334 patients with hepatocellular carcinoma (HCC) at a single center, which had an accuracy of 84.43% in the survival analysis in the presence of 5% missing data. Based on the different combinations of features, our model calculated five features (tumor size, age, treatment, hepatitis status, and alpha fetoprotein) that had the greatest impact on survival in patients with HCC and extracted their weighting factors.
Conclusions
A MCD-independent SVM was developed to achieve prognosis prediction for patients with HCC in the absence of first-visit data.