Simplifying data access: the energy data collection (EDC) project

Authors:
José Luis Ambite;Yigal Arens;Luis Gravano;Vasileios Hatzivassiloglou;Eduard Hovy;Judith Klavans;Andrew Philpot;Usha Ramachandran;Jay Sandhaus;Anurag Singla;Brian Whitman
Affiliations:
University of Southern California, Marina del Rey, CA;University of Southern California, Marina del Rey, CA;Columbia University, New York, NY;Columbia University, New York, NY;University of Southern California, Marina del Rey, CA;Columbia University, New York, NY;University of Southern California, Marina del Rey, CA;University of Southern California, Marina del Rey, CA;Columbia University, New York, NY;Columbia University, New York, NY;Columbia University, New York, NY
Venue:
dg.o '00 Proceedings of the 2000 annual national conference on Digital government research
Year:
2000

Citing 5
Cited 0

Building a large-scale knowledge base for machine translation

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Implementing data cubes efficiently

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Flexible and scalable cost-based query planning in mediators: a transformational approach

Artificial Intelligence - Special issue on Intelligent internet systems
COMPLEX: a computational lexicon for natural language systems

COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 2
TGE: Tlinks Generation Environment

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1

Quantified Score

Hi-index	0.00

Visualization

Abstract

The massive amount of statistical and text data available from government agencies has created a set of daunting challenges to both research and analysis communities. These problems include heterogeneity, size, distribution, and control of terminology. At the Digital Government Research Center we are investigating solutions to these key problems. In this paper we focus on (1) ontological mappings for terminology standardization, (2) data integration across data bases with high speed query processing, and (3) interfaces for query input and presentation of results. This collaboration between researchers from Columbia University and the Information Sciences Institute of the University of Southern California employs technology developed at both locations, in particular the SENSUS ontology, the SIMS multi-database access planner, the LKB automated dictionary and terminology analysis system, and others. The pilot application targets gasoline data from the Bureau of Labor Statistics, the Energy Information Administration of the Department of Energy, the Census Bureau, and other government agencies.