Broad, Interdisciplinary Science In Tela: An Exposure and Child Health Ontology

Jamie McCusker, S. M. Rashid, Zhicheng Liang, Yue Liu, K. Chastain, Paulo Pinheiro da Silva, J. Stingone, D. McGuinness
{"title":"Broad, Interdisciplinary Science In Tela: An Exposure and Child Health Ontology","authors":"Jamie McCusker, S. M. Rashid, Zhicheng Liang, Yue Liu, K. Chastain, Paulo Pinheiro da Silva, J. Stingone, D. McGuinness","doi":"10.1145/3091478.3091497","DOIUrl":null,"url":null,"abstract":"Data curation for interdisciplinary collaborative science requires a new online web-based approach that integrates domain knowledge from multiple resources and enables in tela (in the web) interactive collaboration between data providers, domain specialists, and data analysts. The Children's Health Exposure Analysis Resource (CHEAR) is a resource for child development and environmental exposure data. The CHEAR Data Center has developed an ontology that integrates study and exposure data in a way that is consistent across the program, and integrates with many best practice relevant vocabularies and repository schemas. This includes the World Wide Web Consortium's recommended Provenance Ontology (PROV), Semanticscience Integrated Ontology (SIO), the Chemical Entities of Biological Interest (CheBI) ontology, the Uberon multi-species anatomy ontology, and the Units Ontology as the starting point for our domain modeling. We mapped terms where they overlapped and extended these ontologies with classes that were required to support modeling and integrating data from epidemiology and chemical exposure measurements that comprise the majority of the data recorded by the CHEAR data center. In response to this challenge, we used an on-demand approach to develop the ontology based on a set of representative pilot projects in CHEAR. After initial development, we evaluated the ontology for completeness in representing an additional pilot study. An epidemiologist was able to produce a mapping of the project to the ontology with only minor corrections needed by an ontology expert. In the large dataset that was tested, one third of the classes needed to represent the dataset needed to be added to the ontology, all of them in areas where we expected to see more ontology expansion. Our overall approach is to drive towards completion of coverage while being relatively easy to use for domain experts. Ultimately we aim to have domain experts handle the majority of extensions and evolution with small interactions with ontology experts. In this paper, we report on our on-demand approach for web-based collaborative interdisciplinary ontology development and maintenance and also introduce the resulting extensible and interoperable exposure and child health ontology.","PeriodicalId":165747,"journal":{"name":"Proceedings of the 2017 ACM on Web Science Conference","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 ACM on Web Science Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3091478.3091497","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

Abstract

Data curation for interdisciplinary collaborative science requires a new online web-based approach that integrates domain knowledge from multiple resources and enables in tela (in the web) interactive collaboration between data providers, domain specialists, and data analysts. The Children's Health Exposure Analysis Resource (CHEAR) is a resource for child development and environmental exposure data. The CHEAR Data Center has developed an ontology that integrates study and exposure data in a way that is consistent across the program, and integrates with many best practice relevant vocabularies and repository schemas. This includes the World Wide Web Consortium's recommended Provenance Ontology (PROV), Semanticscience Integrated Ontology (SIO), the Chemical Entities of Biological Interest (CheBI) ontology, the Uberon multi-species anatomy ontology, and the Units Ontology as the starting point for our domain modeling. We mapped terms where they overlapped and extended these ontologies with classes that were required to support modeling and integrating data from epidemiology and chemical exposure measurements that comprise the majority of the data recorded by the CHEAR data center. In response to this challenge, we used an on-demand approach to develop the ontology based on a set of representative pilot projects in CHEAR. After initial development, we evaluated the ontology for completeness in representing an additional pilot study. An epidemiologist was able to produce a mapping of the project to the ontology with only minor corrections needed by an ontology expert. In the large dataset that was tested, one third of the classes needed to represent the dataset needed to be added to the ontology, all of them in areas where we expected to see more ontology expansion. Our overall approach is to drive towards completion of coverage while being relatively easy to use for domain experts. Ultimately we aim to have domain experts handle the majority of extensions and evolution with small interactions with ontology experts. In this paper, we report on our on-demand approach for web-based collaborative interdisciplinary ontology development and maintenance and also introduce the resulting extensible and interoperable exposure and child health ontology.
广泛的,跨学科的Tela科学:暴露和儿童健康本体
跨学科协作科学的数据管理需要一种新的基于网络的在线方法,该方法集成了来自多种资源的领域知识,并使数据提供者、领域专家和数据分析师之间能够在tela(在网络上)进行交互协作。儿童健康暴露分析资源(CHEAR)是儿童发展和环境暴露数据的资源。CHEAR数据中心开发了一个本体,该本体以跨程序一致的方式集成了研究和公开数据,并集成了许多与最佳实践相关的词汇表和存储库模式。这包括万维网联盟推荐的来源本体(PROV),语义科学集成本体(SIO),生物兴趣化学实体(CheBI)本体,Uberon多物种解剖学本体,以及作为我们领域建模起点的单元本体。我们将术语映射到它们重叠的地方,并将这些本体扩展为支持建模和集成流行病学和化学暴露测量数据所需的类,这些数据包括CHEAR数据中心记录的大部分数据。为了应对这一挑战,我们使用了一种按需方法来开发基于CHEAR中一组代表性试点项目的本体。在最初的开发之后,我们评估了本体的完整性,以代表一个额外的试点研究。流行病学家能够生成项目到本体的映射,而本体专家只需要进行微小的修正。在测试的大型数据集中,三分之一的类需要表示数据集,需要添加到本体中,所有这些都在我们期望看到更多本体扩展的领域。我们的总体方法是推动覆盖的完成,同时对领域专家来说相对容易使用。最终,我们的目标是让领域专家通过与本体专家的少量交互来处理大部分的扩展和进化。在本文中,我们报告了基于web的协作跨学科本体开发和维护的按需方法,并介绍了由此产生的可扩展和可互操作的暴露和儿童健康本体。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信