An automated multi-component approach to extracting entity relationships from database requirement specification documents

  • Authors:
  • Siqing Du;Douglas P. Metzler

  • Affiliations:
  • School of Information Sciences, University of Pittsburgh, Pittsburgh, PA;School of Information Sciences, University of Pittsburgh, Pittsburgh, PA

  • Venue:
  • NLDB'06 Proceedings of the 11th international conference on Applications of Natural Language to Information Systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a natural language system that extracts entity relationship diagram components from natural language database design documents. The system is a fully integrated composite of existing, publicly available components including a parser, WordNet and Google web corpus search facilities, and a novel rule-based tuple-extraction process. The system differs from previous approaches in being fully automatic (as opposed to approaches requiring human disambiguation or other interaction) and in providing a higher level of performance than previously reported results.