{"title":"血清蛋白电泳数据解释的监督机器学习模型。","authors":"Yee-Ting Cheung, Hoi-Shan Leung, Jeremiah Sik-Bit Tseung, Kelvin Yat-Chung Yu, Mei-Tik Stella Leung, Chor-Kwan Ching, Yeow-Kuan Chong","doi":"10.1016/j.pathol.2025.05.010","DOIUrl":null,"url":null,"abstract":"<p><p>Serum protein electrophoresis (SPE) is a frequently employed laboratory test in clinical settings, with over 10,000 requests per annum in our centre. It is primarily utilised for the diagnosis and monitoring of paraproteinaemia. Interpretation on SPE is time-consuming and relies on the expertise of pathologists, with potential interobserver variability. Assistance from machine learning algorithms could improve efficiency and objectiveness. Digitised capillary electrophoresis (CE) tracings acquired using the Sebia Minicap Protein(E) 6 kit were extracted from the analyser, and corresponding reports for SPE were obtained from the laboratory information systems of Princess Margaret Hospital (PMH) and Tuen Mun Hospital (TMH). Three artificial neural networks (for fractionation, classification and location plus quantification) were trained and evaluated against reference interpretations from one to two pathologists. Samples from PMH constitute the training datasets. Trained models were subsequently evaluated with samples from TMH. A total of 41,448 and 24,501 CE tracings and corresponding reports for SPE, spanning from October 2014 to November 2022, were obtained from PMH and TMH, respectively; 25,661-41,014 samples from PMH constituted the training datasets. Trained models were subsequently evaluated with 24,238 samples from TMH. The classification model achieved an area under the receiver operating characteristic curve of 0.976 in the testing dataset, with an agreement rate of 93.8%. The fractionation model had mean and standard deviation difference from reported manual fractioning of -0.0884 to 0.155 g/L and 0.315 to 2.04 g/L, respectively, across the six serum protein bands. Peak quantification by the location plus quantification model correlated with manual quantification, with Spearman's r of 0.976. The machine learning models achieved near-human performances. They enabled high-throughput SPE analyses and interpretation and improved objectiveness and reproducibility of results.</p>","PeriodicalId":19915,"journal":{"name":"Pathology","volume":" ","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Supervised machine learning model for serum protein electrophoresis data interpretation.\",\"authors\":\"Yee-Ting Cheung, Hoi-Shan Leung, Jeremiah Sik-Bit Tseung, Kelvin Yat-Chung Yu, Mei-Tik Stella Leung, Chor-Kwan Ching, Yeow-Kuan Chong\",\"doi\":\"10.1016/j.pathol.2025.05.010\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Serum protein electrophoresis (SPE) is a frequently employed laboratory test in clinical settings, with over 10,000 requests per annum in our centre. It is primarily utilised for the diagnosis and monitoring of paraproteinaemia. Interpretation on SPE is time-consuming and relies on the expertise of pathologists, with potential interobserver variability. Assistance from machine learning algorithms could improve efficiency and objectiveness. Digitised capillary electrophoresis (CE) tracings acquired using the Sebia Minicap Protein(E) 6 kit were extracted from the analyser, and corresponding reports for SPE were obtained from the laboratory information systems of Princess Margaret Hospital (PMH) and Tuen Mun Hospital (TMH). Three artificial neural networks (for fractionation, classification and location plus quantification) were trained and evaluated against reference interpretations from one to two pathologists. Samples from PMH constitute the training datasets. Trained models were subsequently evaluated with samples from TMH. A total of 41,448 and 24,501 CE tracings and corresponding reports for SPE, spanning from October 2014 to November 2022, were obtained from PMH and TMH, respectively; 25,661-41,014 samples from PMH constituted the training datasets. Trained models were subsequently evaluated with 24,238 samples from TMH. The classification model achieved an area under the receiver operating characteristic curve of 0.976 in the testing dataset, with an agreement rate of 93.8%. The fractionation model had mean and standard deviation difference from reported manual fractioning of -0.0884 to 0.155 g/L and 0.315 to 2.04 g/L, respectively, across the six serum protein bands. Peak quantification by the location plus quantification model correlated with manual quantification, with Spearman's r of 0.976. The machine learning models achieved near-human performances. They enabled high-throughput SPE analyses and interpretation and improved objectiveness and reproducibility of results.</p>\",\"PeriodicalId\":19915,\"journal\":{\"name\":\"Pathology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-07-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pathology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.pathol.2025.05.010\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PATHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pathology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.pathol.2025.05.010","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PATHOLOGY","Score":null,"Total":0}
Supervised machine learning model for serum protein electrophoresis data interpretation.
Serum protein electrophoresis (SPE) is a frequently employed laboratory test in clinical settings, with over 10,000 requests per annum in our centre. It is primarily utilised for the diagnosis and monitoring of paraproteinaemia. Interpretation on SPE is time-consuming and relies on the expertise of pathologists, with potential interobserver variability. Assistance from machine learning algorithms could improve efficiency and objectiveness. Digitised capillary electrophoresis (CE) tracings acquired using the Sebia Minicap Protein(E) 6 kit were extracted from the analyser, and corresponding reports for SPE were obtained from the laboratory information systems of Princess Margaret Hospital (PMH) and Tuen Mun Hospital (TMH). Three artificial neural networks (for fractionation, classification and location plus quantification) were trained and evaluated against reference interpretations from one to two pathologists. Samples from PMH constitute the training datasets. Trained models were subsequently evaluated with samples from TMH. A total of 41,448 and 24,501 CE tracings and corresponding reports for SPE, spanning from October 2014 to November 2022, were obtained from PMH and TMH, respectively; 25,661-41,014 samples from PMH constituted the training datasets. Trained models were subsequently evaluated with 24,238 samples from TMH. The classification model achieved an area under the receiver operating characteristic curve of 0.976 in the testing dataset, with an agreement rate of 93.8%. The fractionation model had mean and standard deviation difference from reported manual fractioning of -0.0884 to 0.155 g/L and 0.315 to 2.04 g/L, respectively, across the six serum protein bands. Peak quantification by the location plus quantification model correlated with manual quantification, with Spearman's r of 0.976. The machine learning models achieved near-human performances. They enabled high-throughput SPE analyses and interpretation and improved objectiveness and reproducibility of results.
期刊介绍:
Published by Elsevier from 2016
Pathology is the official journal of the Royal College of Pathologists of Australasia (RCPA). It is committed to publishing peer-reviewed, original articles related to the science of pathology in its broadest sense, including anatomical pathology, chemical pathology and biochemistry, cytopathology, experimental pathology, forensic pathology and morbid anatomy, genetics, haematology, immunology and immunopathology, microbiology and molecular pathology.