异构生物多样性数据集数据挖掘与可视化的图技术分析

V. Muñoz, Anna Cohen-Nabeiro, R. David, Vicente José Ivars Camáñez, A. Nonell-Canals, M. A. Senar, D. Couvet, J. Féral, A. Delavaud, T. Tatoni
{"title":"异构生物多样性数据集数据挖掘与可视化的图技术分析","authors":"V. Muñoz, Anna Cohen-Nabeiro, R. David, Vicente José Ivars Camáñez, A. Nonell-Canals, M. A. Senar, D. Couvet, J. Féral, A. Delavaud, T. Tatoni","doi":"10.5220/0006379701440151","DOIUrl":null,"url":null,"abstract":"Extisting biodiversity databases contain an abundance of information. To turn such information into knowledge , it is necessary to address several information-model issues. Biodiversity data are collected for various scientific objectives, often even without clear preliminary objectives, may follow different taxonomy standards and organization logic, and be held in multiple file formats and utilising a variety of database technologies. This paper presents a graph catalogue model for the metadata management of biodiversity databases. It explores the possible operation of data mining and visualization to guide the analysis of heterogeneous biodiversity data. In particular, we would propose contributions to the problems of (1) the analysis of heterogeneous distributed data found across different databases, (2) the identification of matches and approximations between data sets, and (3) the identificaton of relationships between various databases. This paper describes a proof of concept of an infrastructure testbed and its basic operations, presenting an evaluation of the resulting system in comparison with the ideal expectations of the ecologist.","PeriodicalId":414016,"journal":{"name":"International Conference on Complex Information Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Analysis on the Graph Techniques for Data-mining and Visualization of Heterogeneous Biodiversity Data Sets\",\"authors\":\"V. Muñoz, Anna Cohen-Nabeiro, R. David, Vicente José Ivars Camáñez, A. Nonell-Canals, M. A. Senar, D. Couvet, J. Féral, A. Delavaud, T. Tatoni\",\"doi\":\"10.5220/0006379701440151\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Extisting biodiversity databases contain an abundance of information. To turn such information into knowledge , it is necessary to address several information-model issues. Biodiversity data are collected for various scientific objectives, often even without clear preliminary objectives, may follow different taxonomy standards and organization logic, and be held in multiple file formats and utilising a variety of database technologies. This paper presents a graph catalogue model for the metadata management of biodiversity databases. It explores the possible operation of data mining and visualization to guide the analysis of heterogeneous biodiversity data. In particular, we would propose contributions to the problems of (1) the analysis of heterogeneous distributed data found across different databases, (2) the identification of matches and approximations between data sets, and (3) the identificaton of relationships between various databases. This paper describes a proof of concept of an infrastructure testbed and its basic operations, presenting an evaluation of the resulting system in comparison with the ideal expectations of the ecologist.\",\"PeriodicalId\":414016,\"journal\":{\"name\":\"International Conference on Complex Information Systems\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Complex Information Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5220/0006379701440151\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Complex Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5220/0006379701440151","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

现有的生物多样性数据库包含了丰富的信息。为了将这些信息转化为知识,有必要解决几个信息模型问题。生物多样性数据是为各种科学目标收集的,通常甚至没有明确的初步目标,可能遵循不同的分类标准和组织逻辑,并以多种文件格式保存并利用各种数据库技术。提出了一种用于生物多样性数据库元数据管理的图目录模型。探讨了数据挖掘和可视化的可能操作,以指导异质性生物多样性数据的分析。特别是,我们将提出对以下问题的贡献:(1)跨不同数据库发现的异构分布式数据的分析,(2)数据集之间匹配和近似的识别,以及(3)各种数据库之间关系的识别。本文描述了基础设施试验台及其基本操作的概念证明,并将结果系统与生态学家的理想期望进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Analysis on the Graph Techniques for Data-mining and Visualization of Heterogeneous Biodiversity Data Sets
Extisting biodiversity databases contain an abundance of information. To turn such information into knowledge , it is necessary to address several information-model issues. Biodiversity data are collected for various scientific objectives, often even without clear preliminary objectives, may follow different taxonomy standards and organization logic, and be held in multiple file formats and utilising a variety of database technologies. This paper presents a graph catalogue model for the metadata management of biodiversity databases. It explores the possible operation of data mining and visualization to guide the analysis of heterogeneous biodiversity data. In particular, we would propose contributions to the problems of (1) the analysis of heterogeneous distributed data found across different databases, (2) the identification of matches and approximations between data sets, and (3) the identificaton of relationships between various databases. This paper describes a proof of concept of an infrastructure testbed and its basic operations, presenting an evaluation of the resulting system in comparison with the ideal expectations of the ecologist.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信