{"title":"从数字化全切片组织学图像到生物标志物发现:脑癌病理中手工特征分析的协议","authors":"Xuanjun Lu, Yawen Ying, Jing Chen, Zhiyang Chen, Yuxin Wu, Prateek Prasanna, Xin Chen, Mingli Jing, Zaiyi Liu, Cheng Lu","doi":"10.1002/brx2.70030","DOIUrl":null,"url":null,"abstract":"<p>Hematoxylin and eosin (H&E)-stained histopathological slides contain abundant information about cellular and tissue morphology and have been the cornerstone of tumor diagnosis for decades. In recent years, advancements in digital pathology have made whole-slide images (WSIs) widely applicable for diagnosis, prognosis, and prediction in brain cancer. However, there remains a lack of systematic tools and standardized protocols for using handcrafted features in brain cancer histological analysis. In this study, we present a protocol for handcrafted feature analysis in brain cancer pathology (PHBCP) to systematically extract, analyze, model, and visualize handcrafted features from WSIs. The protocol enabled the discovery of biomarkers from WSIs through a series of well-defined steps. The PHBCP comprises seven main steps: (1) problem definition, (2) data quality control, (3) image preprocessing, (4) feature extraction, (5) feature filtering, (6) modeling, and (7) performance analysis. As an exemplary application, we collected pathological data of 589 patients from two cohorts and applied the PHBCP to predict the 2-year survival of glioblastoma multiforme (GBM) patients. Among the 72 models combining nine feature selection methods and eight machine learning classifiers, the optimal model combination achieved discriminative performance with an average area under the curve (AUC) of 0.615 over 100 iterations under five-fold cross-validation. In the external validation cohort, the optimal model combination achieved a generalization performance with an AUC of 0.594. We provide an open-source code repository (GitHub website: https://github.com/XuanjunLu/PHBCP) to facilitate effective collaboration between medical and technical experts, thereby advancing the field of computational pathology in brain cancer.</p>","PeriodicalId":94303,"journal":{"name":"Brain-X","volume":"3 2","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/brx2.70030","citationCount":"0","resultStr":"{\"title\":\"From digitized whole-slide histology images to biomarker discovery: A protocol for handcrafted feature analysis in brain cancer pathology\",\"authors\":\"Xuanjun Lu, Yawen Ying, Jing Chen, Zhiyang Chen, Yuxin Wu, Prateek Prasanna, Xin Chen, Mingli Jing, Zaiyi Liu, Cheng Lu\",\"doi\":\"10.1002/brx2.70030\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Hematoxylin and eosin (H&E)-stained histopathological slides contain abundant information about cellular and tissue morphology and have been the cornerstone of tumor diagnosis for decades. In recent years, advancements in digital pathology have made whole-slide images (WSIs) widely applicable for diagnosis, prognosis, and prediction in brain cancer. However, there remains a lack of systematic tools and standardized protocols for using handcrafted features in brain cancer histological analysis. In this study, we present a protocol for handcrafted feature analysis in brain cancer pathology (PHBCP) to systematically extract, analyze, model, and visualize handcrafted features from WSIs. The protocol enabled the discovery of biomarkers from WSIs through a series of well-defined steps. The PHBCP comprises seven main steps: (1) problem definition, (2) data quality control, (3) image preprocessing, (4) feature extraction, (5) feature filtering, (6) modeling, and (7) performance analysis. As an exemplary application, we collected pathological data of 589 patients from two cohorts and applied the PHBCP to predict the 2-year survival of glioblastoma multiforme (GBM) patients. Among the 72 models combining nine feature selection methods and eight machine learning classifiers, the optimal model combination achieved discriminative performance with an average area under the curve (AUC) of 0.615 over 100 iterations under five-fold cross-validation. In the external validation cohort, the optimal model combination achieved a generalization performance with an AUC of 0.594. We provide an open-source code repository (GitHub website: https://github.com/XuanjunLu/PHBCP) to facilitate effective collaboration between medical and technical experts, thereby advancing the field of computational pathology in brain cancer.</p>\",\"PeriodicalId\":94303,\"journal\":{\"name\":\"Brain-X\",\"volume\":\"3 2\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-05-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/brx2.70030\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Brain-X\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/brx2.70030\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Brain-X","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/brx2.70030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
From digitized whole-slide histology images to biomarker discovery: A protocol for handcrafted feature analysis in brain cancer pathology
Hematoxylin and eosin (H&E)-stained histopathological slides contain abundant information about cellular and tissue morphology and have been the cornerstone of tumor diagnosis for decades. In recent years, advancements in digital pathology have made whole-slide images (WSIs) widely applicable for diagnosis, prognosis, and prediction in brain cancer. However, there remains a lack of systematic tools and standardized protocols for using handcrafted features in brain cancer histological analysis. In this study, we present a protocol for handcrafted feature analysis in brain cancer pathology (PHBCP) to systematically extract, analyze, model, and visualize handcrafted features from WSIs. The protocol enabled the discovery of biomarkers from WSIs through a series of well-defined steps. The PHBCP comprises seven main steps: (1) problem definition, (2) data quality control, (3) image preprocessing, (4) feature extraction, (5) feature filtering, (6) modeling, and (7) performance analysis. As an exemplary application, we collected pathological data of 589 patients from two cohorts and applied the PHBCP to predict the 2-year survival of glioblastoma multiforme (GBM) patients. Among the 72 models combining nine feature selection methods and eight machine learning classifiers, the optimal model combination achieved discriminative performance with an average area under the curve (AUC) of 0.615 over 100 iterations under five-fold cross-validation. In the external validation cohort, the optimal model combination achieved a generalization performance with an AUC of 0.594. We provide an open-source code repository (GitHub website: https://github.com/XuanjunLu/PHBCP) to facilitate effective collaboration between medical and technical experts, thereby advancing the field of computational pathology in brain cancer.