{"title":"从肿瘤病理报告中提取肿瘤部位的逆回归","authors":"A. Dubey, Hong-Jun Yoon, G. Tourassi","doi":"10.1109/BHI.2019.8834527","DOIUrl":null,"url":null,"abstract":"Pathology reports are the primary source of information for cancer diagnosis of millions of the cancer patients across the United States. Cancer registries label these reports every year. The coded labels incorporate pertinent information such as cancer location, behavior, and histology. This information when combined with clinical information, medical imaging and even genomic information have a great potential to fuel discoveries in cancer research. The coding process is manual and requires many human experts to label the large volume of pathology reports in a timely manner. In this study, we have developed a supervised inverse regression based auto-labeler to automate the task. The experiments were conducted on a set of 942 pathology reports with human expert labels as the ground truth. We observed that the inverse regression based auto-labeler consistently performed better than or comparable to the best performing state-of-the-art method. These results demonstrate the potential of inverse regression for reliable information extraction from the pathology reports.","PeriodicalId":281971,"journal":{"name":"2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Inverse Regression for Extraction of Tumor Site from Cancer Pathology Reports\",\"authors\":\"A. Dubey, Hong-Jun Yoon, G. Tourassi\",\"doi\":\"10.1109/BHI.2019.8834527\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Pathology reports are the primary source of information for cancer diagnosis of millions of the cancer patients across the United States. Cancer registries label these reports every year. The coded labels incorporate pertinent information such as cancer location, behavior, and histology. This information when combined with clinical information, medical imaging and even genomic information have a great potential to fuel discoveries in cancer research. The coding process is manual and requires many human experts to label the large volume of pathology reports in a timely manner. In this study, we have developed a supervised inverse regression based auto-labeler to automate the task. The experiments were conducted on a set of 942 pathology reports with human expert labels as the ground truth. We observed that the inverse regression based auto-labeler consistently performed better than or comparable to the best performing state-of-the-art method. These results demonstrate the potential of inverse regression for reliable information extraction from the pathology reports.\",\"PeriodicalId\":281971,\"journal\":{\"name\":\"2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-05-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BHI.2019.8834527\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BHI.2019.8834527","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Inverse Regression for Extraction of Tumor Site from Cancer Pathology Reports
Pathology reports are the primary source of information for cancer diagnosis of millions of the cancer patients across the United States. Cancer registries label these reports every year. The coded labels incorporate pertinent information such as cancer location, behavior, and histology. This information when combined with clinical information, medical imaging and even genomic information have a great potential to fuel discoveries in cancer research. The coding process is manual and requires many human experts to label the large volume of pathology reports in a timely manner. In this study, we have developed a supervised inverse regression based auto-labeler to automate the task. The experiments were conducted on a set of 942 pathology reports with human expert labels as the ground truth. We observed that the inverse regression based auto-labeler consistently performed better than or comparable to the best performing state-of-the-art method. These results demonstrate the potential of inverse regression for reliable information extraction from the pathology reports.