Natural Language Modeling for Phoneme-to-Text Transcription
IEEE Transactions on Pattern Analysis and Machine Intelligence
Grammatical category disambiguation by statistical optimization
Computational Linguistics
Studies in part of speech labelling
HLT '91 Proceedings of the workshop on Speech and Natural Language
A Computational Approach to Grammatical Coding of English Words
Journal of the ACM (JACM)
A stochastic parts program and noun phrase parser for unrestricted text
ANLC '88 Proceedings of the second conference on Applied natural language processing
A practical part-of-speech tagger
ANLC '92 Proceedings of the third conference on Applied natural language processing
Acquiring disambiguation rules from text
ACL '89 Proceedings of the 27th annual meeting on Association for Computational Linguistics
Augmenting a hidden Markov model for phrase-dependent word tagging
HLT '89 Proceedings of the workshop on Speech and Natural Language
Generation and Evaluation of Indexes for Chemistry Articles
Journal of Intelligent Information Systems
From grammar to lexicon: unsupervised learning of lexical syntax
Computational Linguistics - Special issue on using large corpora: II
Architectural elements of language engineering robustness
Natural Language Engineering
A definition and short history of Language Engineering
Natural Language Engineering
Example-based correction of word segmentation and part of speech labelling
HLT '93 Proceedings of the workshop on Human Language Technology
HLT '93 Proceedings of the workshop on Human Language Technology
(Almost) automatic semantic feature extraction from technical text
HLT '94 Proceedings of the workshop on Human Language Technology
Videolyzer: quality analysis of online informational video for bloggers and journalists
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Implementing and evaluating phrasal query suggestions for proximity search
Information Systems
Implementing and evaluating phrasal query suggestions for proximity search
Information Systems
Implementing a sense tagger in a general architecture for text engineering
NeMLaP3/CoNLL '98 Proceedings of the Joint Conferences on New Methods in Language Processing and Computational Natural Language Learning
Predicting subjectivity in multimodal conversations
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
An Information-Extraction System for Urdu---A Resource-Poor Language
ACM Transactions on Asian Language Information Processing (TALIP)
Wordica: Emergence of linguistic representations for words by independent component analysis
Natural Language Engineering
DTMBIO '10 Proceedings of the ACM fourth international workshop on Data and text mining in biomedical informatics
Methods and algorithms for automatic text analysis
Automatic Documentation and Mathematical Linguistics
A web mining method based on personal ontology for semi-structured RDF
WISE'05 Proceedings of the 2005 international conference on Web Information Systems Engineering
Collective semantic role labeling for tweets with clustering
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Unsupervised discovery of relations for analysis of textual data
Digital Investigation: The International Journal of Digital Forensics & Incident Response
Transforming trees to improve syntactic convergence
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Two-stage NER for tweets with clustering
Information Processing and Management: an International Journal
Named entity recognition for tweets
ACM Transactions on Intelligent Systems and Technology (TIST) - Special section on twitter and microblogging services, social recommender systems, and CAMRa2010: Movie recommendation in context
A ruled-based part of speech (RPOS) tagger for malay text articles
ACIIDS'13 Proceedings of the 5th Asian conference on Intelligent Information and Database Systems - Volume Part II
A Semantic Triplet Based Story Classifier
ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)
A dynamic P300-based BCI speller using a language model
International Journal of Advanced Intelligence Paradigms
Information Services and Use - Mining the Digital Information Networks
Hi-index | 0.00 |
Automatic part of speech tagging is an area of natural language processing where statistical techniques have been more successful than rule-based methods. In this paper, we present a simple rule-based part of speech tagger which automatically acquires its rules and tags with accuracy comparable to stochastic taggers. The rule-based tagger has many advantages over these taggers, including: a vast reduction in stored information required, the perspicuity of a small set of meaningful rules, ease of finding and implementing improvements to the tagger, and better portability from one tag set, corpus genre or language to another. Perhaps the biggest contribution of this work is in demonstrating that the stochastic method is not the only viable method for part of speech tagging. The fact that a simple rule-based tagger that automatically learns its rules can perform so well should offer encouragement for researchers to further explore rule-based tagging, searching for a better and more expressive set of rule templates and other variations on the simple but effective theme described below.