Scientific names are ambiguous as identifiers for biological taxa: their context and definition are required for accurate data integration

  • Authors:
  • Jessie B. Kennedy;Robert Kukla;Trevor Paterson

  • Affiliations:
  • School of Computing, Napier University, Edinburgh, UK;School of Computing, Napier University, Edinburgh, UK;School of Computing, Napier University, Edinburgh, UK

  • Venue:
  • DILS'05 Proceedings of the Second international conference on Data Integration in the Life Sciences
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Biologists use scientific names to label the organisms described in their data; however, these names are not unique identifiers for taxonomic entities. Alternative taxonomic classifications may apply the same name, associated with alternative definition or circumscription. Consequently, labelling data with scientific names alone does not unambiguously distinguish between taxon concepts. Accurate integration and comparison of biological data is required on taxon concepts, as defined in alternative taxonomic classifications. We have derived an abstract, inclusive model for the diverse representations of taxonomic concepts used by taxonomists and in taxonomic databases. This model has been implemented as a proposed standard XML schema for the exchange and comparison of taxonomic concepts between data providers and users. The representation and exchange of taxon definitions conformant with this schema will facilitate the development of taxonomic name/concept resolution services, allowing the meaningful integration and comparison of biological datasets, with greater accuracy than on the basis of name alone.