An algorithm for pronominal anaphora resolution
Computational Linguistics
A hierarchical approach to wrapper induction
Proceedings of the third annual conference on Autonomous Agents
An Algorithm that Learns What‘s in a Name
Machine Learning - Special issue on natural language learning
Learning Information Extraction Rules for Semi-Structured and Free Text
Machine Learning - Special issue on natural language learning
Machine Learning for Information Extraction in Informal Domains
Machine Learning - Special issue on information retrieval
A New, Fully Automatic Version of Mitkov's Knowledge-Poor Pronoun Resolution Method
CICLing '02 Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing
Learning rules and their exceptions
The Journal of Machine Learning Research
Bottom-up relational learning of pattern matching rules for information extraction
The Journal of Machine Learning Research
Disambiguation of proper names in text
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Anaphora for everyone: pronominal anaphora resoluation without a parser
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Automatic acquisition of domain knowledge for Information Extraction
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Evaluating parts-of-speech taggers for use in a text-to-scene conversion system
SAICSIT '05 Proceedings of the 2005 annual research conference of the South African institute of computer scientists and information technologists on IT research in developing countries
Adaptive information extraction from text by rule induction and generalisation
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Constraint-based conversion of fiction text to a time-based graphical representation
Proceedings of the 2007 annual research conference of the South African institute of computer scientists and information technologists on IT research in developing countries
Mechanisms for multimodality: taking fiction to another dimension
AFRIGRAPH '07 Proceedings of the 5th international conference on Computer graphics, virtual reality, visualisation and interaction in Africa
Hi-index | 0.00 |
This paper presents a hierarchical pattern matching and generalisation technique which is applied to the problem of locating the correct speaker of quoted speech found in fiction books. Patterns from a training set are generalised to create a small number of rules, which can be used to identify items of interest within the text. The pattern matching technique is applied to finding the Speech-Verb, Actor and Speaker of quotes found in fiction books. The technique performs well over the training data, resulting in rule-sets many times smaller than the training set, but providing very high accuracy. While the rule-set generalised from one book is less effective when applied to different books than an approach based on hand coded heuristics, performance is comparable when testing on data closely related to the training set.