Manuel Tran, Paul Schmidle, Sophia J. Wagner, Valentin Koch, Valerio Lupperger, Annette Feuchtinger, Alexander Boehner, Robert Kaczmarczyk, Tilo Biedermann, Kilian Eyerich, Stephan A. Braun, Tingying Peng, Carsten Marr
{"title":"利用 HistoGPT 从千兆像素全切片图像生成高度准确的病理报告","authors":"Manuel Tran, Paul Schmidle, Sophia J. Wagner, Valentin Koch, Valerio Lupperger, Annette Feuchtinger, Alexander Boehner, Robert Kaczmarczyk, Tilo Biedermann, Kilian Eyerich, Stephan A. Braun, Tingying Peng, Carsten Marr","doi":"10.1101/2024.03.15.24304211","DOIUrl":null,"url":null,"abstract":"Histopathology is considered the gold standard for determining the presence and nature of disease, particularly cancer. However, the process of analyzing tissue samples and producing a final pathology report is time-consuming, labor-intensive, and non-standardized. Therefore, new technological solutions are being sought to reduce the workload of pathologists. In this work, we present HistoGPT, a vision language model that takes digitized slides as input and generates reports that match the quality of human-written reports, as confirmed by natural language processing metrics and domain expert evaluations. We show that HistoGPT generalizes to five international cohorts and can predict tumor subtypes and tumor thickness in a zero-shot fashion. Our work represents an important step toward integrating AI into the medical workflow. We publish both model code and weights so that the scientific community can apply and improve HistoGPT to advance the field of computational pathology.","PeriodicalId":501528,"journal":{"name":"medRxiv - Pathology","volume":"31 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Generating highly accurate pathology reports from gigapixel whole slide images with HistoGPT\",\"authors\":\"Manuel Tran, Paul Schmidle, Sophia J. Wagner, Valentin Koch, Valerio Lupperger, Annette Feuchtinger, Alexander Boehner, Robert Kaczmarczyk, Tilo Biedermann, Kilian Eyerich, Stephan A. Braun, Tingying Peng, Carsten Marr\",\"doi\":\"10.1101/2024.03.15.24304211\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Histopathology is considered the gold standard for determining the presence and nature of disease, particularly cancer. However, the process of analyzing tissue samples and producing a final pathology report is time-consuming, labor-intensive, and non-standardized. Therefore, new technological solutions are being sought to reduce the workload of pathologists. In this work, we present HistoGPT, a vision language model that takes digitized slides as input and generates reports that match the quality of human-written reports, as confirmed by natural language processing metrics and domain expert evaluations. We show that HistoGPT generalizes to five international cohorts and can predict tumor subtypes and tumor thickness in a zero-shot fashion. Our work represents an important step toward integrating AI into the medical workflow. We publish both model code and weights so that the scientific community can apply and improve HistoGPT to advance the field of computational pathology.\",\"PeriodicalId\":501528,\"journal\":{\"name\":\"medRxiv - Pathology\",\"volume\":\"31 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"medRxiv - Pathology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2024.03.15.24304211\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Pathology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.03.15.24304211","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Generating highly accurate pathology reports from gigapixel whole slide images with HistoGPT
Histopathology is considered the gold standard for determining the presence and nature of disease, particularly cancer. However, the process of analyzing tissue samples and producing a final pathology report is time-consuming, labor-intensive, and non-standardized. Therefore, new technological solutions are being sought to reduce the workload of pathologists. In this work, we present HistoGPT, a vision language model that takes digitized slides as input and generates reports that match the quality of human-written reports, as confirmed by natural language processing metrics and domain expert evaluations. We show that HistoGPT generalizes to five international cohorts and can predict tumor subtypes and tumor thickness in a zero-shot fashion. Our work represents an important step toward integrating AI into the medical workflow. We publish both model code and weights so that the scientific community can apply and improve HistoGPT to advance the field of computational pathology.