SIGMOD '86 Proceedings of the 1986 ACM SIGMOD international conference on Management of data
Tools and methods for computational lexicology
Computational Linguistics - Special issue of the lexicon
Computational lexicography for natural language processing
Computational lexicography for natural language processing
Creating and querying lexical data bases
ANLC '88 Proceedings of the second conference on Applied natural language processing
Building a large thesaurus for information retrieval
ANLC '88 Proceedings of the second conference on Applied natural language processing
Structure-sharing in lexical representation
ACL '85 Proceedings of the 23rd annual meeting on Association for Computational Linguistics
Acquisition of semantic information from an on-line dictionary
COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 1
Extraction of semantic information from an ordinary English dictionary and its evaluation
COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 2
Towards understanding text with a very large vocabulary
HLT '90 Proceedings of the workshop on Speech and Natural Language
Extracting taxonomic relationships from on-line definitional sources using LEXING
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
Best-Match Retrieval for Structured Images
IEEE Transactions on Pattern Analysis and Machine Intelligence
A System for Approximate Tree Matching
IEEE Transactions on Knowledge and Data Engineering
Accumulation of lexical sets: acquisition of dictionary resources and production of new lexical sets
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 3
Machine-readable dictionaries in text-to-speech systems
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Enjoy the paper: lexical semantics via lexicology
COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 2
Extracting sense trees from the Romanian thesaurus by sense segmentation & dependency parsing
COGALEX '08 Proceedings of the workshop on Cognitive Aspects of the Lexicon
Hi-index | 0.00 |
We identify two complementary processes in the conversion of machine-readable dictionaries into lexical databases: recovery of the dictionary stucture from the typographical markings which persist on the dictionary distribution tapes and embody the publishers' notational conventions; followed by making explicit all of the codified and ellided information packed into individual entries. We discuss notational conventions and tape formats, outline structural properties of dictionaries, observe a range of representational phenomena particularly relevant to dictionary parsing, and derive a set of minimal requirements for a dictionary grammar formalism. We present a general purpose dictionary entry parser which uses a formal notation designed to describe the structure of entries and performs a mapping from the flat character stream on the tape to a highly structured and fully instantiated representation of the dictionary. We demonstrate the power of the formalism by drawing examples from a range of dictionary sources which have been processed and converted into lexical databases.