Building a large-scale knowledge base for machine translation
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
CYC: a large-scale investment in knowledge infrastructure
Communications of the ACM
Deriving concept hierarchies from text
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Snowball: extracting relations from large plain-text collections
DL '00 Proceedings of the fifth ACM conference on Digital libraries
A maximum entropy approach to identifying sentence boundaries
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
A workbench for finding structure in texts
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Human-centered systems for business services
dg.o '04 Proceedings of the 2004 annual national conference on Digital government research
AusDM '06 Proceedings of the fifth Australasian conference on Data mining and analystics - Volume 61
Ontology generation for large email collections
dg.o '08 Proceedings of the 2008 international conference on Digital government research
Learning the distance metric in a personal ontology
Proceedings of the 2nd international workshop on Ontologies and information systems for the semantic web
Validating Documentation with Domain Ontologies
Proceedings of the 2005 conference on New Trends in Software Methodologies, Tools and Techniques: Proceedings of the fourth SoMeT_W05
Hi-index | 0.00 |
We discuss a domain-independent, corpus based method for dictionary-less automatic extraction of ontological knowledge from domain-specific unannotated documents. We present the architecture, algorithms, and results for ONTOSTRUCT---a new system that uses machine learning and statistical techniques to analyze text sources, discover terms, link equivalent terms into concepts, learn both hierarchical and non-hierarchical conceptual relations, and build an extensive, semantically sound hierarchy of concepts. We report on ONTOSTRUCT's results in constructing a domain-specific ontology for the business registration domain, and evaluate the performance of two of its modules.