Analysis on the Graph Techniques for Data-mining and Visualization of Heterogeneous Biodiversity Data Sets

Autor: Anna Cohen-Nabeiro, Miquel A. Senar, Jean-Pierre Féral, Denis Couvet, Aurélie Delavaud, Vicente José Ivars Camáñez, Víctor Méndez Muñoz, Alfons Nonell-Canals, Thierry Tatoni, Romain David
Přispěvatelé: Department of Computer Architecture & Operating Systems (CAOS), Fondation pour la recherche sur la Biodiversité (FRB), Institut méditerranéen de biodiversité et d'écologie marine et continentale (IMBE), Avignon Université (AU)-Aix Marseille Université (AMU)-Institut de recherche pour le développement [IRD] : UMR237-Centre National de la Recherche Scientifique (CNRS), Mind the Byte, Centre d'Ecologie et des Sciences de la COnservation (CESCO), Muséum national d'Histoire naturelle (MNHN)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Centre National de la Recherche Scientifique (CNRS)
Jazyk: angličtina
Rok vydání: 2017
Předmět:
Zdroj: Complexis 2017
Complexis 2017, Apr 2017, Porto, Portugal. pp.144-151, ⟨10.5220/0006379701440151⟩
Scopus-Elsevier
COMPLEXIS
DOI: 10.5220/0006379701440151⟩
Popis: International audience; Extisting biodiversity databases contain an abundance of information. To turn such information into knowledge , it is necessary to address several information-model issues. Biodiversity data are collected for various scientific objectives, often even without clear preliminary objectives, may follow different taxonomy standards and organization logic, and be held in multiple file formats and utilising a variety of database technologies. This paper presents a graph catalogue model for the metadata management of biodiversity databases. It explores the possible operation of data mining and visualization to guide the analysis of heterogeneous biodiversity data. In particular, we would propose contributions to the problems of (1) the analysis of heterogeneous distributed data found across different databases, (2) the identification of matches and approximations between data sets, and (3) the identificaton of relationships between various databases. This paper describes a proof of concept of an infrastructure testbed and its basic operations, presenting an evaluation of the resulting system in comparison with the ideal expectations of the ecologist.
Databáze: OpenAIRE