TnT: a statistical part-of-speech tagger

Authors:
Thorsten Brants
Affiliations:
Saarland University, Saarbrücken, Germany
Venue:
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Year:
2000

Citing 5
Cited 306

A corpus-based approach to language learning

A corpus-based approach to language learning
Building a large annotated corpus of English: the penn treebank

Computational Linguistics - Special issue on using large corpora: II
A practical part-of-speech tagger

ANLC '92 Proceedings of the third conference on Applied natural language processing
An annotation scheme for free word order languages

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Improving data driven wordclass tagging by system combination

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1

Learning to lemmatise slovene words

Learning language in logic
Shallow Parsing Using Probabilistic Grammatical Inference

ICGI '02 Proceedings of the 6th International Colloquium on Grammatical Inference: Algorithms and Applications
Statistical Part-of-Speech Tagging for Classical Chinese

TSD '02 Proceedings of the 5th International Conference on Text, Speech and Dialogue
Achieving an Almost Correct PoS-Tagged Corpus

TSD '02 Proceedings of the 5th International Conference on Text, Speech and Dialogue
Rules for Automatic Grapheme-to-Allophone Transcription in Slovene

TDS '00 Proceedings of the Third International Workshop on Text, Speech and Dialogue
The Possibilities of Automatic Detection/Correction of Errors in Tagged Corpora: A Pilot Study on a German Corpus

TSD '01 Proceedings of the 4th International Conference on Text, Speech and Dialogue
Formal Methods of Tokenization for Part-of-Speech Tagging

CICLing '02 Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing
A Common Solution for Tokenization and Part-of-Speech Tagging

TSD '02 Proceedings of the 5th International Conference on Text, Speech and Dialogue
Impact of imperfect OCR on part-of-speech tagging

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Shallow parsing using specialized hmms

The Journal of Machine Learning Research
Shallow parsing with pos taggers and linguistic features

The Journal of Machine Learning Research
Shallow parsing using noisy and non-stationary training material

The Journal of Machine Learning Research
Lattice-based tagging using support vector machines

CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
The influence of semantics in IR using LSI and K-means clustering techniques

ISICT '03 Proceedings of the 1st international symposium on Information and communication technologies
Improving accuracy in word class tagging through the combination of machine learning systems

Computational Linguistics
Retrieving NASA problem reports: a case study in natural language information retrieval

Data & Knowledge Engineering - NLDB2002
Improving part-of-speech tagging using lexicalized HMMs

Natural Language Engineering
Multimodal model integration for sentence unit detection

Proceedings of the 6th international conference on Multimodal interfaces
Acquisition of categorized named entities for web search

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Lexicon acquisition with a large-coverage unification-based grammar

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 2
How to build a QA system in your back-garden: application for Romanian

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 2
A cross-language document retrieval system based on semantic annotation

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 2
Investigating GIS and smoothing for maximum entropy taggers

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Constraint based integration of deep and shallow parsing techniques

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Empirical methods for compound splitting

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Combining clues for word alignment

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
(Semi-)automatic detection of errors in PoS-tagged corpora

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
A stochastic topological parser for German

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Syntactic features for high precision word sense disambiguation

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Annotating topological fields and chunks: and revising POS tags at the same time

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Applying Co-Training to reference resolution

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Revision learning and its application to part-of-speech tagging

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Feature-rich part-of-speech tagging with a cyclic dependency network

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Bayesian nets in syntactic categorization of novel words

NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
Probabilistic parsing for German using sister-head dependencies

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Self-organizing Markov models and their application to part-of-speech tagging

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Language independent, minimally supervised induction of lexical probabilities

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Optimization of word alignment clues

Natural Language Engineering
Using induced rules as complex features in memory-based language learning

ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Enriching the knowledge sources used in a maximum entropy part-of-speech tagger

EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
One sense per collocation and genre/topic variations

EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
Multidimensional transformation-based learning

ConLL '01 Proceedings of the 2001 workshop on Computational Natural Language Learning - Volume 7
The TELRI tool catalogue: structure and prospects

STAR '01 Proceedings of the ACL 2001 Workshop on Sharing Tools and Resources - Volume 15
An interactive spreadsheet for teaching the forward-backward algorithm

ETMTNLP '02 Proceedings of the ACL-02 Workshop on Effective tools and methodologies for teaching natural language processing and computational linguistics - Volume 1
Building a hyponymy lexicon with hierarchical structure

ULA '02 Proceedings of the ACL-02 workshop on Unsupervised lexical acquisition - Volume 9
A multilingual approach to disambiguate prepositions and case suffixes

WSD '02 Proceedings of the ACL-02 workshop on Word sense disambiguation: recent successes and future directions - Volume 8
Conditional structure versus conditional estimation in NLP models

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
The influence of minimum edit distance on reference resolution

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Topological field chunking for German

COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Learning sequence-to-sequence correspondences from parallel corpora via sequential pattern mining

HLT-NAACL-PARALLEL '03 Proceedings of the HLT-NAACL 2003 Workshop on Building and using parallel texts: data driven machine translation and beyond - Volume 3
Bootstrapping POS taggers using unlabelled data

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Named entity recognition using hundreds of thousands of features

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Blueprint for a high performance NLP infrastructure

SEALTS '03 Proceedings of the HLT-NAACL 2003 workshop on Software engineering and architecture of language technology systems - Volume 8
Towards supporting on-demand virtual remodularization using program graphs

Proceedings of the 5th international conference on Aspect-oriented software development
Linguistic knowledge in statistical phrase-based word alignment

Natural Language Engineering
ME-based biomedical named entity recognition using lexical knowledge

ACM Transactions on Asian Language Information Processing (TALIP)
Domain-specific language models and lexicons for tagging

Journal of Biomedical Informatics
Tagging of name records for genealogical data browsing

Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries
The importance of the lexicon in tagging biological text

Natural Language Engineering
Beyond N in N-gram tagging

ACLstudent '04 Proceedings of the ACL 2004 workshop on Student research
Supersense tagging of unknown nouns using semantic similarity

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
What to do when lexicalization fails: parsing German with suffix analysis and smoothing

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Guiding a constraint dependency parser with supertags

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Hybrid parsing: using probabilistic models as predictors for a symbolic parser

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Names and similarities on the web: fact extraction in the fast lane

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Toward unsupervised whole-corpus tagging

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Japanese unknown word identification by character-based chunking

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Chinese and Japanese word segmentation using word-level and character-level information

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Tagging with hidden Markov models using ambiguous tags

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
New statistical methods for phrase break prediction

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
High-performance tagging on medical texts

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Collocation extraction based on modifiability statistics

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
A semantic-based approach to interoperability of classification hierarchies: evaluation of linguistic techniques

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Bidirectional inference with the easiest-first strategy for tagging sequence data

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Ontology based text indexing and querying for the semantic web

Knowledge-Based Systems
Natural language tagging with genetic algorithms

Information Processing Letters
A no-frills architecture for lightweight answer retrieval

Proceedings of the 16th international conference on World Wide Web
Automatic Word Spacing Using Probabilistic Models Based on Character n-grams

IEEE Intelligent Systems
Unsupervised estimation for noisy-channel models

Proceedings of the 24th international conference on Machine learning
Improving statistical MT by coupling reordering and decoding

Machine Translation
Lightweight web-based fact repositories for textual question answering

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
The role of documents vs. queries in extracting class attributes from text

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Efficient interactive query expansion with complete search

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Spoken language annotation and data-driven modelling of phone-level pronunciation in discourse context

Speech Communication
Part-of-speech tagging of modern hebrew text

Natural Language Engineering
Towards temporal web search

Proceedings of the 2008 ACM symposium on Applied computing
Ripple Down Rule learning for automated word lemmatisation

AI Communications
Combining automatic acquisition of knowledge with machine learning approaches for multilingual temporal recognition and normalization

Information Sciences: an International Journal
Part-of-Speech Tagging Based on Machine Translation Techniques

IbPRIA '07 Proceedings of the 3rd Iberian conference on Pattern Recognition and Image Analysis, Part I
Accuracy of Baseline and Complex Methods Applied to Morphosyntactic Tagging of Polish

ICCS '08 Proceedings of the 8th international conference on Computational Science, Part I
Portuguese Part-of-Speech Tagging Using Entropy Guided Transformation Learning

PROPOR '08 Proceedings of the 8th international conference on Computational Processing of the Portuguese Language
A Comparison of Language Models for Dialog Act Segmentation of Meeting Transcripts

TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
On the impact of morphology in English to Spanish statistical MT

Speech Communication
METIS-II: low resource machine translation

Machine Translation
Who Is It? Context Sensitive Named Entity and Instance Recognition by Means of Wikipedia

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Extracting Dependency Trees from Sanskrit Texts

Proceedings of the 3rd International Symposium on Sanskrit Computational Linguistics
Low-Cost Supervision for Multiple-Source Attribute Extraction

CICLing '09 Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing
On the Impact of Lexical and Linguistic Features in Genre- and Domain-Based Categorization

CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
HunPos: an open source trigram tagger

ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
Automatic part-of-speech tagging for Bengali: an approach for morphologically rich languages in a poor resource scenario

ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
A Video Retrieval System for Computer Assisted Language Learning

Proceedings of the 2005 conference on Artificial Intelligence in Education: Supporting Learning through Intelligent and Socially Informed Technology
Part-of-speech tagging of Northern Sotho: disambiguating polysemous function words

AfLaT '09 Proceedings of the First Workshop on Language Technologies for African Languages
Methods for Amharic part-of-speech tagging

AfLaT '09 Proceedings of the First Workshop on Language Technologies for African Languages
Creating a test corpus of clinical notes manually tagged for part-of-speech information

JNLPBA '04 Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications
Incorporating lexical knowledge into biomedical NE recognition

JNLPBA '04 Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications
Exploiting context for biomedical entity recognition: from syntax to the web

JNLPBA '04 Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications
Using Short Dependency Relations from Auto-Parsed Data for Chinese Dependency Parsing

ACM Transactions on Asian Language Information Processing (TALIP)
Web-derived resources for web information retrieval: from conceptual hierarchies to attribute hierarchies

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Statistical term profiling for query pattern mining

BioNLP '08 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
Towards retrieving relevant information for answering clinical comparison questions

BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
PNEPs, NEPs for Context Free Parsing: Application to Natural Language Processing

IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part I: Bio-Inspired Systems: Computational and Ambient Intelligence
Organizing and searching the world wide web of facts - step one: the one-million fact extraction challenge

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Learning reliable information for dependency parsing adaptation

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Unsupervised induction of labeled parse trees by clustering with syntactic features

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Estimation of conditional probabilities with decision trees and an application to fine-grained POS tagging

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Tagging Portuguese with a Spanish tagger using cognates

CrossLangInduction '06 Proceedings of the International Workshop on Cross-Language Knowledge Induction
The Spanish resource grammar: pre-processing strategy and lexical acquisition

DeepLP '07 Proceedings of the Workshop on Deep Linguistic Processing
A suite of shallow processing tools for Portuguese: LX-suite

EACL '06 Proceedings of the Eleventh Conference of the European Chapter of the Association for Computational Linguistics: Posters & Demonstrations
N-gram-based statistical machine translation versus syntax augmented machine translation: comparison and system combination

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Correcting a PoS-tagged corpus using three complementary methods

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Tagging Urdu text with parts of speech: a tagger comparison

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Semi-supervised training for the averaged perceptron POS tagger

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
An empirical approach to the interpretation of superlatives

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Using linguistically motivated features for paragraph boundary identification

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Broad-coverage sense disambiguation and information extraction with a supersense sequence tagger

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Feature-based segmentation of narrative documents

FeatureEng '05 Proceedings of the ACL Workshop on Feature Engineering for Machine Learning in Natural Language Processing
Towards domain-independent deep linguistic processing: ensuring portability and re-usability of lexicalised grammars

GEAF '08 Proceedings of the Workshop on Grammar Engineering Across Frameworks
A reconfigurable stochastic tagger for languages with complex tag structure

MorphSlav '03 Proceedings of the 2003 EACL Workshop on Morphological Processing of Slavic Languages
Sentence fusion via dependency graph compression

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Part-of-speech tagging for English-Spanish code-switched text

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Tagging Icelandic text using a linguistic and a statistical tagger

NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Improving Word Alignment Using Alignment of Deep Structures

TSD '09 Proceedings of the 12th International Conference on Text, Speech and Dialogue
Deriving a large scale taxonomy from Wikipedia

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Turning web text and search queries into factual knowledge: hierarchical class attribute extraction

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Finding cars, goddesses and enzymes: parametrizable acquisition of labeled instances for open-domain information extraction

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Context-dependent alignment models for statistical machine translation

NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Unsupervised approaches for automatic keyword extraction using meeting transcripts

NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Improving a simple bigram HMM part-of-speech tagger by latent annotation and self-training

NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Multi-dimensional annotation and alignment in an English-German translation corpus

NLPXML '06 Proceedings of the 5th Workshop on NLP and XML: Multi-Dimensional Markup in Natural Language Processing
Part-of-speech tagging with a symbolic full parser: using the TIGER treebank to evaluate Fips

PaGe '08 Proceedings of the Workshop on Parsing German
POS tagging of dialectal Arabic: a minimally supervised approach

Semitic '05 Proceedings of the ACL Workshop on Computational Approaches to Semitic Languages
Using shallow syntax information to improve word alignment and reordering for SMT

StatMT '08 Proceedings of the Third Workshop on Statistical Machine Translation
The TALP-UPC Ngram-based statistical machine translation system for ACL-WMT 2008

StatMT '08 Proceedings of the Third Workshop on Statistical Machine Translation
Phrase-based and deep syntactic English-to-Czech statistical machine translation

StatMT '08 Proceedings of the Third Workshop on Statistical Machine Translation
Syntax-oriented evaluation measures for machine translation output

StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
The RWTH machine translation system for WMT 2009

StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
A core-tools statistical NLP course

TeachNLP '05 Proceedings of the Second ACL Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing and Computational Linguistics
Web-based frequency dictionaries for medium density languages

WAC '06 Proceedings of the 2nd International Workshop on Web as Corpus
Hybrid methods for POS guessing of Chinese unknown words

ACLstudent '05 Proceedings of the ACL Student Research Workshop
Jointly labeling multiple sequences: a factorial HMM approach

ACLstudent '05 Proceedings of the ACL Student Research Workshop
Phrase linguistic classification and generalization for improving statistical machine translation

ACLstudent '05 Proceedings of the ACL Student Research Workshop
An unsupervised system for identifying English inclusions in German text

ACLstudent '05 Proceedings of the ACL Student Research Workshop
Developing online ICALL exercises for Russian

EANL '08 Proceedings of the Third Workshop on Innovative Use of NLP for Building Educational Applications
Automatic knowledge representation using a graph-based algorithm for language-independent lexical chaining

IEBeyondDoc '06 Proceedings of the Workshop on Information Extraction Beyond The Document
A measure of aggregate syntactic distance

LD '06 Proceedings of the Workshop on Linguistic Distances
Active learning for part-of-speech tagging: accelerating corpus annotation

LAW '07 Proceedings of the Linguistic Annotation Workshop
A two-stage method for active learning of statistical grammars

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Induction of fine-grained part-of-speech taggers via classifier combination and crosslingual projection

ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts
TALP phrase-based statistical translation system for European language pairs

StatMT '06 Proceedings of the Workshop on Statistical Machine Translation
N-gram-based SMT system enhanced with reordering patterns

StatMT '06 Proceedings of the Workshop on Statistical Machine Translation
Finding hedges by chasing weasels: hedge detection using Wikipedia tags and shallow linguistic features

ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
LX-Center: a center of online linguistic services

ACLDemos '09 Proceedings of the ACL-IJCNLP 2009 Software Demonstrations
Analysis and development of Urdu POS tagged corpus

ALR7 Proceedings of the 7th Workshop on Asian Language Resources
Construction of a German HPSG grammar from a detailed treebank

GEAF '09 Proceedings of the 2009 Workshop on Grammar Engineering Across Frameworks
By all these lovely tokens...: merging conflicting tokenizations

ACL-IJCNLP '09 Proceedings of the Third Linguistic Annotation Workshop
Mining search engine clickthrough log for matching N-gram features

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Improving dependency parsing with subtrees from auto-parsed data

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
On statistical parsing of French with supervised and semi-supervised strategies

CLAGI '09 Proceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference
Exploiting aligned parallel corpora in multilingual studies and applications

IWIC'07 Proceedings of the 1st international conference on Intercultural collaboration
A support vector machine approach to dutch part-of-speech tagging

IDA'07 Proceedings of the 7th international conference on Intelligent data analysis
Morpho-syntactic post-processing of N-best lists for improved French automatic speech recognition

Computer Speech and Language
A hidden Markov model based named entity recognition system: Bengali and Hindi as case studies

PReMI'07 Proceedings of the 2nd international conference on Pattern recognition and machine intelligence
XML rules for enclitic segmentation

EUROCAST'07 Proceedings of the 11th international conference on Computer aided systems theory
Why don't Romanians have a five o'clock tea, Nor Halloween, but have a kind of Valentines day?

CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
Performance analysis of a part of speech tagging task

CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
Using ILP to construct features for information extraction from semi-structured text

ILP'07 Proceedings of the 17th international conference on Inductive logic programming
Automated identification of LTL patterns in natural language requirements

ISSRE'09 Proceedings of the 20th IEEE international conference on software reliability engineering
Improving Persian information retrieval systems using stemming and part of speech tagging

CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
Acquisition of instance attributes via labeled and related instances

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
An Information-Extraction System for Urdu---A Resource-Poor Language

ACM Transactions on Asian Language Information Processing (TALIP)
Wordica: Emergence of linguistic representations for words by independent component analysis

Natural Language Engineering
On automated evaluation of readability of summaries: capturing grammaticality, focus, structure and coherence

HLT-SRWS '10 Proceedings of the NAACL HLT 2010 Student Research Workshop
Efficient staggered decoding for sequence labeling

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Creating robust supervised classifiers via web-scale N-gram data

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Profiting from mark-up: hyper-text annotations for guided parsing

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Coreference resolution across corpora: languages, coding schemes, and preprocessing information

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Issues on quality assessment of SNOMED CT® subsets: term validation and term extraction

WBIE '09 Proceedings of the Workshop on Biomedical Information Extraction
Deriving clinical query patterns from medical corpora using domain ontologies

WBIE '09 Proceedings of the Workshop on Biomedical Information Extraction
Unsupervised Part-of-Speech Tagging in the Large

Research on Language and Computation
Variable-length Markov models and ambiguous words in Portuguese

YIWCALA '10 Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas
Variable-length Markov models and ambiguous words in Portuguese

YIWCALA '10 Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas
Challenges of cheap resource creation for morphological tagging

LAW IV '10 Proceedings of the Fourth Linguistic Annotation Workshop
Using collocation segmentation to augment the phrase table

WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
The CUED HiFST system for the WMT10 translation shared task

WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
Resolving speculation: MaxEnt cue classification and dependency-based scope rules

CoNLL '10: Shared Task Proceedings of the Fourteenth Conference on Computational Natural Language Learning --- Shared Task
Crouching Dirichlet, hidden Markov model: unsupervised POS tagging with context local tag generation

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Uptraining for accurate deterministic question parsing

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Feature selection for fluency ranking

INLG '10 Proceedings of the 6th International Natural Language Generation Conference
Generating learner-like morphological errors in Russian

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Clustering polish texts with latent semantic analysis

ICAISC'10 Proceedings of the 10th international conference on Artifical intelligence and soft computing: Part II
Improving the accessibility of ASCII graphics for the blind students: producing my own graphics

ICCHP'10 Proceedings of the 12th international conference on Computers helping people with special needs
Application of stacked methods to part-of-speech tagging of polish

PPAM'09 Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part I
Part-of-speech tagging using parallel weighted finite-state transducers

IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
Information extraction from concise passages of natural language sources

ADBIS'10 Proceedings of the 14th east European conference on Advances in databases and information systems
TALP at GikiCLEF 2009

CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
The application of structured learning in natural language processing

Machine Translation
Adding morphological information to a connectionist part-of-speech tagger

CAEPIA'09 Proceedings of the Current topics in artificial intelligence, and 13th conference on Spanish association for artificial intelligence
Automatic part of speech tagging for Arabic: an experiment using Bigram hidden Markov model

RSKT'10 Proceedings of the 5th international conference on Rough set and knowledge technology
Improving hierarchical document signature performance by classifier combination

ICONIP'10 Proceedings of the 17th international conference on Neural information processing: theory and algorithms - Volume Part I
A comparison of unsupervised methods for part-of-speech tagging in Chinese

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Verbs are where all the action lies: experiences of shallow parsing of a morphologically rich language

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
The role of queries in ranking labeled instances extracted from text

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
A multi-domain web-based algorithm for POS tagging of unknown words

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Towards better ontological support for recognizing textual entailment

EKAW'10 Proceedings of the 17th international conference on Knowledge engineering and management by the masses
Tamil dependency parsing: results using rule based and corpus based approaches

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Dependency syntax analysis using grammar induction and a lexical categories precedence system

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Ripple down rules for part-of-speech tagging

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
An efficient part-of-speech tagger for arabic

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Taxonomy induction based on a collaboratively built knowledge repository

Artificial Intelligence
Developing a competitive HMM arabic POS tagger using small training corpora

ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part I
Improving arabic part-of-speech tagging through morphological analysis

ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part I
Recursive alignment block classification technique for word reordering in statistical machine translation

Language Resources and Evaluation
The ACL Anthology Searchbench

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Systems Demonstrations
Unsupervised part-of-speech tagging with bilingual graph-based projections

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Piggyback: using search engines for robust cross-domain named entity recognition

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Recovering semantics of tables on the web

Proceedings of the VLDB Endowment
A gold standard corpus of early modern German

LAW V '11 Proceedings of the 5th Linguistic Annotation Workshop
MWU-aware part-of-speech tagging with a CRF model and lexical resources

MWE '11 Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World
A comparative study of classifier combination methods applied to NLP tasks

NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
Adapting a WSJ trained part-of-speech tagger to noisy text: preliminary results

Proceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data
Probabilistic Grammars and Languages

Journal of Logic, Language and Information
Cross-Domain Effects on Parse Selection for Precision Grammars

Research on Language and Computation
Advances in deep parsing of scholarly paper content

NLP4DL'09/AT4DL'09 Proceedings of the 2009 international conference on Advanced language technologies for digital libraries
A token centric part-of-speech tagger for biomedical text

AIME'11 Proceedings of the 13th conference on Artificial intelligence in medicine
Exploring a corpus-based approach for detecting language impairment in monolingual English-speaking children

Artificial Intelligence in Medicine
Asking what no one has asked before: using phrase similarities to generate synthetic web search queries

Proceedings of the 20th ACM international conference on Information and knowledge management
On the road to high-quality POS-tagging

KI'05 Proceedings of the 28th annual German conference on Advances in Artificial Intelligence
Language modelling for the needs of OCR of medical texts

ISBMDA'06 Proceedings of the 7th international conference on Biological and Medical Data Analysis
Improving statistical word alignments with morpho-syntactic transformations

FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
Supervised textrank

FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
Tagging a morphologically complex language using heuristics

FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
Developing a robust part-of-speech tagger for biomedical text

PCI'05 Proceedings of the 10th Panhellenic conference on Advances in Informatics
Mining paraphrases from self-anchored web sentence fragments

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
The GeoTALP-IR system at GeoCLEF 2005: experiments using a QA-Based IR system, linguistic analysis, and a geographical thesaurus

CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
Extending the tool, or how to annotate historical language varieties

LaTeCH '11 Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
A low-budget tagger for Old Czech

LaTeCH '11 Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
Evaluating an 'off-the-shelf' POS-tagger on early modern German text

LaTeCH '11 Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
Automatic verb extraction from historical Swedish texts

LaTeCH '11 Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
Automatic prosodic event detection using a novel labeling and selection method in co-training

Speech Communication
Comparing two markov methods for part-of-speech tagging of portuguese

IBERAMIA-SBIA'06 Proceedings of the 2nd international joint conference, and Proceedings of the 10th Ibero-American Conference on AI 18th Brazilian conference on Advances in Artificial Intelligence
Experiments in cross-language morphological annotation transfer

CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
A sentence compression module for machine-assisted subtitling

CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Investigating the best configuration of HMM spanish pos tagger when minimum amount of training data is available

NLDB'05 Proceedings of the 10th international conference on Natural Language Processing and Information Systems
Unsupervised evaluation of parser robustness

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Distributional thesaurus versus wordnet: a comparison of backoff techniques for unsupervised PP attachment

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Finding instance names and alternative glosses on the web: wordnet reloaded

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Applying stacking and corpus transformation to a chunking task

EUROCAST'05 Proceedings of the 10th international conference on Computer Aided Systems Theory
Training a parser for machine translation reordering

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Aligning needles in a haystack: paraphrase acquisition across the web

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Two-phase biomedical named entity recognition using a hybrid method

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Automatically inducing a part-of-speech tagger by projecting from multiple source languages across aligned corpora

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Inductive improvement of part-of-speech tagging and its effect on a terminology of molecular biology

AI'05 Proceedings of the 18th Canadian Society conference on Advances in Artificial Intelligence
Voting between multiple data representations for text chunking

AI'05 Proceedings of the 18th Canadian Society conference on Advances in Artificial Intelligence
Data-Driven part-of-speech tagging of kiswahili

TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue
Making stone soup: evaluating a recall-oriented multi-stream question answering system for dutch

CLEF'04 Proceedings of the 5th conference on Cross-Language Evaluation Forum: multilingual Information Access for Text, Speech and Images
COLE experiments at QA@CLEF 2004 spanish monolingual track

CLEF'04 Proceedings of the 5th conference on Cross-Language Evaluation Forum: multilingual Information Access for Text, Speech and Images
Improving POS tagging for ungrammatical phrases

Proceedings of the 2012 Joint International Conference on Human-Centered Computer Environments
Dedicated nominal featurization of portuguese

PROPOR'06 Proceedings of the 7th international conference on Computational Processing of the Portuguese Language
DILUCT: an open-source spanish dependency parser based on rules, heuristics, and selectional preferences

NLDB'06 Proceedings of the 11th international conference on Applications of Natural Language to Information Systems
A ubiquitous agent for unrestricted vocabulary learning in noisy digital environments

ITS'06 Proceedings of the 8th international conference on Intelligent Tutoring Systems
Unsupervised part-of-speech disambiguation for high frequency words and its influence on unsupervised parsing

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
Quantitative evaluation of grammaticality of summaries

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
Syntactic language modeling with formal grammars

Speech Communication
By all these lovely tokens... Merging conflicting tokenizations

Language Resources and Evaluation
CuteForce: deep deterministic HPSG parsing

IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
The latent words language model

Computer Speech and Language
Predictive text entry for agglutinative languages using unsupervised morphological segmentation

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part II
Verb analysis in a highly inflective language with an MFF algorithm

PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
An automated method for identifying TRIZ evolution trends from patents

Expert Systems with Applications: An International Journal
Aircraft interior failure pattern recognition utilizing text mining and neural networks

Journal of Intelligent Information Systems
Sibyl, a factoid question-answering system for spoken documents

ACM Transactions on Information Systems (TOIS)
Speculation and negation: Rules, rankers, and the role of syntax

Computational Linguistics
Subject and object identification in Malayalam text

Proceedings of the International Conference on Advances in Computing, Communications and Informatics
Experiments on POS tagging and data driven dependency parsing for Telugu language

Proceedings of the International Conference on Advances in Computing, Communications and Informatics
Feature-rich part-of-speech tagging for morphologically complex languages: application to Bulgarian

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
The effects of semantic annotations on precision parse ranking

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Full machine translation for factoid question answering

EACL 2012 Proceedings of the Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra)
Parsing the past: identification of verb constructions in historical text

LaTeCH '12 Proceedings of the 6th Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
Unsupervised dependency parsing using reducibility and fertility features

WILS '12 Proceedings of the NAACL-HLT Workshop on the Induction of Linguistic Structure
Extracting glossary sentences from scholarly articles: a comparative evaluation of pattern bootstrapping and deep analysis

ACL '12 Proceedings of the ACL-2012 Special Workshop on Rediscovering 50 Years of Discoveries
A cost sensitive part-of-speech tagging: differentiating serious errors from minor errors

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Exploiting reducibility in unsupervised dependency parsing

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Exact sampling and decoding in high-order hidden Markov models

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
TALP at GeoCLEF 2006: experiments Using JIRS and Lucene with the ADL feature type thesaurus

CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
Automatic Korean word spacing using Pegasos algorithm

Information Processing and Management: an International Journal
Part of speech tagging for arabic

Natural Language Engineering
Rule-Based morphological tagger for an inflectional language

COST'11 Proceedings of the 2011 international conference on Cognitive Behavioural Systems
Combining multiple classifiers using vote based classifier ensemble technique for named entity recognition

Data & Knowledge Engineering
A comparative study of classifier combination applied to NLP tasks

Information Fusion
Full Length Article: Simulated annealing based classifier ensemble techniques: Application to part of speech tagging

Information Fusion
Knowledge sources for constituent parsing of german, a morphologically rich and less-configurational language

Computational Linguistics
Stacked ensemble coupled with feature selection for biomedical entity extraction

Knowledge-Based Systems
Statistical machine translation enhancements through linguistic levels: A survey

ACM Computing Surveys (CSUR)
Incremental, predictive parsing with psycholinguistically motivated tree-adjoining grammar

Computational Linguistics
Dealing with orthographic variation in a tagger-lemmatizer for fourteenth century Dutch charters

Language Resources and Evaluation
Evaluating and automating the annotation of a learner corpus

Language Resources and Evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Trigrams'n'Tags (TnT) is an efficient statistical part-of-speech tagger. Contrary to claims found elsewhere in the literature, we argue that a tagger based on Markov models performs at least as well as other current approaches, including the Maximum Entropy framework. A recent comparison has even shown that TnT performs significantly better for the tested corpora. We describe the basic model of TnT, the techniques used for smoothing and for handling unknown words. Furthermore, we present evaluations on two corpora.