Bidirectional inference with the easiest-first strategy for tagging sequence data

Authors:
Yoshimasa Tsuruoka;Jun'ichi Tsujii
Affiliations:
CREST, JST (Japan Science and Technology Corporation), Saitama, Japan and University of Tokyo, Tokyo, Japan;University of Tokyo, Tokyo, Japan and University of Manchester, MANCHESTER, UK and CREST, JST (Japan Science and Technology Corporation), Saitama, Japan
Venue:
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Year:
2005

Citing 13
Cited 65

A maximum entropy approach to natural language processing

Computational Linguistics
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Text chunking based on a generalization of winnow

The Journal of Machine Learning Research
Building a large annotated corpus of English: the penn treebank

Computational Linguistics - Special issue on using large corpora: II
TnT: a statistical part-of-speech tagger

ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Representing text chunks

EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Chunking with support vector machines

NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Feature-rich part-of-speech tagging with a cyclic dependency network

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
A SNoW based supertagger with application to NP chunking

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Introduction to the CoNLL-2000 shared task: chunking

ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Use of support vector learning for chunk identification

ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Discriminative training methods for hidden Markov models: theory and experiments with perceptron algorithms

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Evaluation and extension of maximum entropy models with inequality constraints

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing

Semantic retrieval for the accurate identification of relational concepts in massive textbases

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Integrating data and text mining processes for digital library applications

Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Information distance from a question to an answer

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Feature forest models for probabilistic hpsg parsing

Computational Linguistics
DODDLE-OWL: Interactive Domain Ontology Development with Open Source Software in Java

IEICE - Transactions on Information and Systems
New information distance measure and its application in question answering system

Journal of Computer Science and Technology
Search-based structured prediction

Machine Learning
Minimally lexicalized dependency parsing

ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
Learning the scope of hedge cues in biomedical texts

BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
A supervised learning approach to biological question answering

Integrated Computer-Aided Engineering - Selected papers from the IEEE Conference on Information Reuse and Integration (IRI), July 13-15, 2008
A fast boosting-based learner for feature-rich tagging and chunking

CoNLL '08 Proceedings of the Twelfth Conference on Computational Natural Language Learning
A metalearning approach to processing the scope of negation

CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Classifying what-type questions by head noun tagging

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Shift-reduce dependency DAG parsing

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Extremely lexicalized models for accurate and fast HPSG parsing

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Learning the scope of negation in biomedical texts

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Extending lexical association measures for collocation extraction

Computer Speech and Language
A log-linear model with an n-gram reference distribution for accurate HPSG parsing

IWPT '07 Proceedings of the 10th International Conference on Parsing Technologies
UPAR7: a knowledge-based system for headline sentiment tagging

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
Ambiguous part-of-speech tagging for improving accuracy and domain portability of syntactic parsers

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Chunk parsing revisited

Parsing '05 Proceedings of the Ninth International Workshop on Parsing Technology
Dependency grammar induction via bitext projection constraints

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Text classification using graph mining-based feature extraction

Knowledge-Based Systems
Biomedical question answering: A survey

Computer Methods and Programs in Biomedicine
A topological embedding of the lexicon for semantic distance computation

Natural Language Engineering
Efficient staggered decoding for sequence labeling

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Using SVMs with the command relation features to identify negated events in biomedical literature

NeSp-NLP '10 Proceedings of the Workshop on Negation and Speculation in Natural Language Processing
SemEval-2010 task 14: Word sense induction & disambiguation

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Posterior Regularization for Structured Latent Variable Models

The Journal of Machine Learning Research
Variable-length Markov models and ambiguous words in Portuguese

YIWCALA '10 Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas
Variable-length Markov models and ambiguous words in Portuguese

YIWCALA '10 Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas
Improving gender classification of blog authors

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Latent-descriptor clustering for unsupervised POS induction

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Automatic classification of sentences for evidence based medicine

DTMBIO '10 Proceedings of the ACM fourth international workshop on Data and text mining in biomedical informatics
Comparing and combining chunkers of biomedical text

Journal of Biomedical Informatics
Part-of-speech tagging from 97% to 100%: is it time for some linguistics?

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Ripple down rules for part-of-speech tagging

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Modeling reciprocity in social interactions with probabilistic latent space models

Natural Language Engineering
Review spotlight: a user interface for summarizing user-generated reviews using adjective-noun word pairs

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Ontology learning from biomedical natural language documents using UMLS

Expert Systems with Applications: An International Journal
Unsupervised relation extraction using dependency trees for automatic generation of multiple-choice questions

Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
Learning with lookahead: can history-based models rival globally optimized models?

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Improving Korean verb-verb morphological disambiguation using lexical knowledge from unambiguous unlabeled data and selective web counts

Pattern Recognition Letters
Methods and algorithms for automatic text analysis

Automatic Documentation and Mathematical Linguistics
The taming of reconcile as a biomedical coreference resolver

BioNLP Shared Task '11 Proceedings of the BioNLP Shared Task 2011 Workshop
A framework for schema-driven relationship discovery from unstructured text

ISWC'06 Proceedings of the 5th international conference on The Semantic Web
Classification and pattern discovery of mood in weblogs

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
DODDLE-OWL: a domain ontology construction tool with OWL

ASWC'06 Proceedings of the First Asian conference on The Semantic Web
A framework and its empirical study of automatic diagnosis of traditional Chinese medicine utilizing raw free-text clinical records

Journal of Biomedical Informatics
On-the-Fly generation of facets as navigation signs for web objects

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
Analysis of adjective-noun word pair extraction methods for online review summarization

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Language supports for journal abstract writing across disciplines

Journal of Computer Assisted Learning
Lexical surprisal as a general predictor of reading time

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Feature-rich part-of-speech tagging for morphologically complex languages: application to Bulgarian

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
*SEM 2012 shared task: resolving the scope and focus of negation

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Iterative viterbi A* algorithm for k-best sequential decoding

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
A cost sensitive part-of-speech tagging: differentiating serious errors from minor errors

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Building a lightweight semantic model for unsupervised information extraction on short listings

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Enhancing search: events and their discourse context

CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
Methodological Review: Biomedical text mining and its applications in cancer research

Journal of Biomedical Informatics
Multiple-choice cloze exercise generation through English grammar learning support

International Journal of Knowledge and Web Intelligence
Named entity recognition with multiple segment representations

Information Processing and Management: an International Journal
"Mining events from the literature for bioinformatics applications" by S. Ananiadou, P. Thompson, and R. Nawaz; with Martin Vesely as coordinator

ACM SIGWEB Newsletter
Supervised hypothesis discovery using syllogistic patterns in the biomedical literature

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Evaluating Word Sense Induction and Disambiguation Methods

Language Resources and Evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a bidirectional inference algorithm for sequence labeling problems such as part-of-speech tagging, named entity recognition and text chunking. The algorithm can enumerate all possible decomposition structures and find the highest probability sequence together with the corresponding decomposition structure in polynomial time. We also present an efficient decoding algorithm based on the easiest-first strategy, which gives comparably good performance to full bidirectional inference with significantly lower computational cost. Experimental results of part-of-speech tagging and text chunking show that the proposed bidirectional inference methods consistently outperform unidirectional inference methods and bidirectional MEMMs give comparable performance to that achieved by state-of-the-art learning algorithms including kernel support vector machines.