癌症登记数据二次使用的知识图谱方法

S. Hasan, D. Rivera, Xiao-Cheng Wu, J. B. Christian, G. Tourassi
{"title":"癌症登记数据二次使用的知识图谱方法","authors":"S. Hasan, D. Rivera, Xiao-Cheng Wu, J. B. Christian, G. Tourassi","doi":"10.1109/BHI.2019.8834538","DOIUrl":null,"url":null,"abstract":"Population-based central cancer registries collect valuable structured and unstructured cancer data primarily for surveillance and reporting. The collected data includes (1) categorization of each cancer case (tumor) at the time of diagnosis, (2) demographic information about the patient such as age, gender, and location at time of diagnosis, (3) first course of treatment information, and (4) survival outcomes when available. While advanced analytical approaches such as SEER*Stat and SAS exist, we provide a knowledge graph approach to organizing cancer registry data for advanced analytics which offers unique advantages over existing approaches. This knowledge graph approach semantically enriches the data and enables straightforward linking capability with third-party data to help understand variation in cancer outcomes. A knowledge graph was developed using Louisiana Tumor Registry data. We present the advantages of the knowledge graph approach by examining: i) scenario-specific queries and ii) linkages with publicly available external datasets. Our results demonstrate this graph based solution can perform complex queries, improve query run-time performance by 81%, and more easily conduct iterative analyses to enhance researchers understanding of cancer registry data.","PeriodicalId":281971,"journal":{"name":"2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Knowledge Graph Approach for the Secondary Use of Cancer Registry Data\",\"authors\":\"S. Hasan, D. Rivera, Xiao-Cheng Wu, J. B. Christian, G. Tourassi\",\"doi\":\"10.1109/BHI.2019.8834538\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Population-based central cancer registries collect valuable structured and unstructured cancer data primarily for surveillance and reporting. The collected data includes (1) categorization of each cancer case (tumor) at the time of diagnosis, (2) demographic information about the patient such as age, gender, and location at time of diagnosis, (3) first course of treatment information, and (4) survival outcomes when available. While advanced analytical approaches such as SEER*Stat and SAS exist, we provide a knowledge graph approach to organizing cancer registry data for advanced analytics which offers unique advantages over existing approaches. This knowledge graph approach semantically enriches the data and enables straightforward linking capability with third-party data to help understand variation in cancer outcomes. A knowledge graph was developed using Louisiana Tumor Registry data. We present the advantages of the knowledge graph approach by examining: i) scenario-specific queries and ii) linkages with publicly available external datasets. Our results demonstrate this graph based solution can perform complex queries, improve query run-time performance by 81%, and more easily conduct iterative analyses to enhance researchers understanding of cancer registry data.\",\"PeriodicalId\":281971,\"journal\":{\"name\":\"2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI)\",\"volume\":\"53 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BHI.2019.8834538\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BHI.2019.8834538","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

以人群为基础的中央癌症登记处收集有价值的结构化和非结构化癌症数据,主要用于监测和报告。收集的数据包括(1)诊断时每个癌症病例(肿瘤)的分类,(2)患者的人口学信息,如诊断时的年龄、性别和位置,(3)第一个疗程信息,(4)可用的生存结果。虽然存在先进的分析方法,如SEER*Stat和SAS,但我们提供了一种知识图谱方法来组织癌症登记数据进行高级分析,这比现有方法提供了独特的优势。这种知识图谱方法在语义上丰富了数据,并实现了与第三方数据的直接链接能力,以帮助理解癌症结果的变化。利用路易斯安那州肿瘤登记处的数据开发了一个知识图谱。我们通过检查:i)场景特定查询和ii)与公开可用的外部数据集的链接来展示知识图方法的优势。我们的研究结果表明,这种基于图的解决方案可以执行复杂的查询,将查询运行时性能提高81%,并且更容易进行迭代分析,从而增强研究人员对癌症注册数据的理解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Knowledge Graph Approach for the Secondary Use of Cancer Registry Data
Population-based central cancer registries collect valuable structured and unstructured cancer data primarily for surveillance and reporting. The collected data includes (1) categorization of each cancer case (tumor) at the time of diagnosis, (2) demographic information about the patient such as age, gender, and location at time of diagnosis, (3) first course of treatment information, and (4) survival outcomes when available. While advanced analytical approaches such as SEER*Stat and SAS exist, we provide a knowledge graph approach to organizing cancer registry data for advanced analytics which offers unique advantages over existing approaches. This knowledge graph approach semantically enriches the data and enables straightforward linking capability with third-party data to help understand variation in cancer outcomes. A knowledge graph was developed using Louisiana Tumor Registry data. We present the advantages of the knowledge graph approach by examining: i) scenario-specific queries and ii) linkages with publicly available external datasets. Our results demonstrate this graph based solution can perform complex queries, improve query run-time performance by 81%, and more easily conduct iterative analyses to enhance researchers understanding of cancer registry data.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信