异构生物多样性数据集数据挖掘与可视化的图技术分析

International Conference on Complex Information Systems Pub Date : 2017-04-24 DOI:10.5220/0006379701440151

V. Muñoz, Anna Cohen-Nabeiro, R. David, Vicente José Ivars Camáñez, A. Nonell-Canals, M. A. Senar, D. Couvet, J. Féral, A. Delavaud, T. Tatoni

{"title":"异构生物多样性数据集数据挖掘与可视化的图技术分析","authors":"V. Muñoz, Anna Cohen-Nabeiro, R. David, Vicente José Ivars Camáñez, A. Nonell-Canals, M. A. Senar, D. Couvet, J. Féral, A. Delavaud, T. Tatoni","doi":"10.5220/0006379701440151","DOIUrl":null,"url":null,"abstract":"Extisting biodiversity databases contain an abundance of information. To turn such information into knowledge , it is necessary to address several information-model issues. Biodiversity data are collected for various scientific objectives, often even without clear preliminary objectives, may follow different taxonomy standards and organization logic, and be held in multiple file formats and utilising a variety of database technologies. This paper presents a graph catalogue model for the metadata management of biodiversity databases. It explores the possible operation of data mining and visualization to guide the analysis of heterogeneous biodiversity data. In particular, we would propose contributions to the problems of (1) the analysis of heterogeneous distributed data found across different databases, (2) the identification of matches and approximations between data sets, and (3) the identificaton of relationships between various databases. This paper describes a proof of concept of an infrastructure testbed and its basic operations, presenting an evaluation of the resulting system in comparison with the ideal expectations of the ecologist.","PeriodicalId":414016,"journal":{"name":"International Conference on Complex Information Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Analysis on the Graph Techniques for Data-mining and Visualization of Heterogeneous Biodiversity Data Sets\",\"authors\":\"V. Muñoz, Anna Cohen-Nabeiro, R. David, Vicente José Ivars Camáñez, A. Nonell-Canals, M. A. Senar, D. Couvet, J. Féral, A. Delavaud, T. Tatoni\",\"doi\":\"10.5220/0006379701440151\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Extisting biodiversity databases contain an abundance of information. To turn such information into knowledge , it is necessary to address several information-model issues. Biodiversity data are collected for various scientific objectives, often even without clear preliminary objectives, may follow different taxonomy standards and organization logic, and be held in multiple file formats and utilising a variety of database technologies. This paper presents a graph catalogue model for the metadata management of biodiversity databases. It explores the possible operation of data mining and visualization to guide the analysis of heterogeneous biodiversity data. In particular, we would propose contributions to the problems of (1) the analysis of heterogeneous distributed data found across different databases, (2) the identification of matches and approximations between data sets, and (3) the identificaton of relationships between various databases. This paper describes a proof of concept of an infrastructure testbed and its basic operations, presenting an evaluation of the resulting system in comparison with the ideal expectations of the ecologist.\",\"PeriodicalId\":414016,\"journal\":{\"name\":\"International Conference on Complex Information Systems\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-04-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Complex Information Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5220/0006379701440151\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Complex Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5220/0006379701440151","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

现有的生物多样性数据库包含了丰富的信息。为了将这些信息转化为知识，有必要解决几个信息模型问题。生物多样性数据是为各种科学目标收集的，通常甚至没有明确的初步目标，可能遵循不同的分类标准和组织逻辑，并以多种文件格式保存并利用各种数据库技术。提出了一种用于生物多样性数据库元数据管理的图目录模型。探讨了数据挖掘和可视化的可能操作，以指导异质性生物多样性数据的分析。特别是，我们将提出对以下问题的贡献:(1)跨不同数据库发现的异构分布式数据的分析，(2)数据集之间匹配和近似的识别，以及(3)各种数据库之间关系的识别。本文描述了基础设施试验台及其基本操作的概念证明，并将结果系统与生态学家的理想期望进行了比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Analysis on the Graph Techniques for Data-mining and Visualization of Heterogeneous Biodiversity Data Sets

Extisting biodiversity databases contain an abundance of information. To turn such information into knowledge , it is necessary to address several information-model issues. Biodiversity data are collected for various scientific objectives, often even without clear preliminary objectives, may follow different taxonomy standards and organization logic, and be held in multiple file formats and utilising a variety of database technologies. This paper presents a graph catalogue model for the metadata management of biodiversity databases. It explores the possible operation of data mining and visualization to guide the analysis of heterogeneous biodiversity data. In particular, we would propose contributions to the problems of (1) the analysis of heterogeneous distributed data found across different databases, (2) the identification of matches and approximations between data sets, and (3) the identificaton of relationships between various databases. This paper describes a proof of concept of an infrastructure testbed and its basic operations, presenting an evaluation of the resulting system in comparison with the ideal expectations of the ecologist.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Conference on Complex Information Systems

自引率

0.00%

发文量