Generic XML-based framework for metadata portals

  • Authors:
  • Uwe Schindler;Michael Diepenbroek

  • Affiliations:
  • Center for Marine Environmental Sciences (MARUM), University of Bremen, Leobener Straíe, D-28359 Bremen, Germany.;Center for Marine Environmental Sciences (MARUM), University of Bremen, Leobener Straíe, D-28359 Bremen, Germany.

  • Venue:
  • Computers & Geosciences
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a generic and flexible framework for building geoscientific metadata portals independent of content standards for metadata and protocols. Data can be harvested with commonly used protocols (e.g., Open Archives Initiative Protocol for Metadata Harvesting) and metadata standards like DIF or ISO 19115. The new Java-based portal software supports any XML encoding and makes metadata searchable through Apache Lucene. Software administrators are free to define searchable fields independent of their type using XPath. In addition, by extending the full-text search engine (FTS) Apache Lucene, we have significantly improved queries for numerical and date/time ranges by supplying a new trie-based algorithm, thus, enabling high-performance space/time retrievals in FTS-based geo portals. The harvested metadata are stored in separate indexes, which makes it possible to combine these into different portals. The portal-specific Java API and web service interface is highly flexible and supports custom front-ends for users, provides automatic query completion (AJAX), and dynamic visualization with conventional mapping tools. The software has been made freely available through the open source concept.