Partial parsing: a report on work in progress
HLT '91 Proceedings of the workshop on Speech and Natural Language
BBN: description of the PLUM system as used for MUC-4
MUC4 '92 Proceedings of the 4th conference on Message understanding
Hi-index | 0.00 |
Our mid-term to long-term goals in data extraction from text for the next one to three years are to achieve much greater portability to new languages and new domains, greater robustness, and greater scalability. The novel aspect to our approach is the use of learning algorithms and probabilistic models to learn the domain-specific and language-specific knowledge necessary for a new domain and new language. Learning algorithms should contribute to scalability by making it feasible to deal with domains where it would be infeasible to invest sufficient human effort to bring a system up. Probabilistic models can contribute to robustness by allowing for words, constructions, and forms not anticipated ahead of time and by looking for the most likely interpretation in context.