Representing text chunks

Authors:
Erik F. Tjong Kim Sang;Jorn Veenstra
Affiliations:
University of Antwerp, Wilrijk, Belgium;Tilburg University, Le Tilburg, The Netherlands
Venue:
EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Year:
1999

Citing 4
Cited 62

Forgetting Exceptions is Harmful in Language Learning

Machine Learning - Special issue on natural language learning
Maximum entropy models for natural language ambiguity resolution

Maximum entropy models for natural language ambiguity resolution
A memory-based approach to learning shallow natural language patterns

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Error-driven pruning of Treebank grammars for base noun phrase identification

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1

Noun phrase chunking with APL2

APL '00 Proceedings of the international conference on APL-Berlin-2000 conference
Memory-based shallow parsing

The Journal of Machine Learning Research
Noun phrase recognition by system combination

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Theory refinement and Natural Language Learning

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Bunsetsu identification using category-exclusive rules

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Noun phrase recognition with tree patterns

Acta Cybernetica
Use of morphological analysis in protein name recognition

Journal of Biomedical Informatics - Special issue: Named entity recognition in biomedicine
Chunking with support vector machines

NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Target word detection and semantic role chunking using support vector machines

NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
Rule writing or annotation: cost-efficient resource usage for base noun phrase chunking

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Support Vector Learning for Semantic Argument Classification

Machine Learning
Shallow parsing by inferencing with classifiers

ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Named entity recognition as a house of cards: classifier stacking

COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Learning with multiple stacking for named entity recognition

COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Protein name tagging for biomedical annotation in text

BioMed '03 Proceedings of the ACL 2003 workshop on Natural language processing in biomedicine - Volume 13
Named entity recognition using a character-based probabilistic approach

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Learning tree patterns for syntactic parsing

Acta Cybernetica
Semantic role labeling of prepositional phrases

ACM Transactions on Asian Language Information Processing (TALIP)
Factorizing complex models: a case study in mention detection

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Chinese and Japanese word segmentation using word-level and character-level information

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Detection of entity mentions occurring in English and Chinese text

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Bidirectional inference with the easiest-first strategy for tagging sequence data

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
A robust multilingual portable phrase chunking system

Expert Systems with Applications: An International Journal
Efficient text chunking using linear kernel with masked method

Knowledge-Based Systems
Adding syntax to dynamic programming for aligning comparable texts for the generation of paraphrases

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Named entity recognition in Vietnamese using classifier voting

ACM Transactions on Asian Language Information Processing (TALIP)
Robust and efficient multiclass SVM models for phrase pattern recognition

Pattern Recognition
Memory-based clause identification

ConLL '01 Proceedings of the 2001 workshop on Computational Natural Language Learning - Volume 7
Fast Semantic Role Labeling for Chinese Based on Semantic Chunking

ICCPOL '09 Proceedings of the 22nd International Conference on Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy
Robust extraction of named entity including unfamiliar word

HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
How to make the most of NE dictionaries in statistical NER

BioNLP '08 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
Mention detection crossing the language barrier

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Automatic Labeling of Semantic Role on Chinese FrameNet Using Conditional Random Fields

WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Morphology-Based Segmentation Combination for Arabic Mention Detection

ACM Transactions on Asian Language Information Processing (TALIP)
Cross-Language Information Propagation for Arabic Mention Detection

ACM Transactions on Asian Language Information Processing (TALIP)
Using conditional random fields for result identification in biomedical abstracts

Integrated Computer-Aided Engineering
Classifier subset selection for biomedical named entity recognition

Applied Intelligence
Bottom-up named entity recognition using a two-stage machine learning method

MWE '09 Proceedings of the Workshop on Multiword Expressions: Identification, Interpretation, Disambiguation and Applications
Analysis and robust extraction of changing named entities

NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Tag confidence measure for semi-automatically updating named entity recognition

NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Semantic role labeling using support vector machines

CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
A decision tree approach to sentence chunking

AI'07 Proceedings of the 20th Australian joint conference on Advances in artificial intelligence
Fast base NP chunking with decision trees: experiments on different POS tag settings

CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
Syntactic and semantic structure for opinion expression detection

CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Automatic creation of a technical trend map from research papers and patents

PaIR '10 Proceedings of the 3rd international workshop on Patent information retrieval
Reranking models in fine-grained opinion analysis

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Labelwise margin maximization for sequence labeling

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Semantic role labeling using maximum entropy

CIS'04 Proceedings of the First international conference on Computational and Information Science
Word folding: taking the snapshot of words instead of the whole

IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
Leveraging word confusion networks for named entity modeling and detection from conversational telephone speech

Speech Communication
Automatic time expression labeling for english and chinese text

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
Learning syntactic patterns using boosting and other classifier combination schemas

TSD'05 Proceedings of the 8th international conference on Text, Speech and Dialogue
Voting between multiple data representations for text chunking

AI'05 Proceedings of the 18th Canadian Society conference on Advances in Artificial Intelligence
A named entity extraction using word information repeatedly collected from unlabeled data

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
Clinical entity recognition using structural support vector machines with rich features

Proceedings of the ACM sixth international workshop on Data and text mining in biomedical informatics
Simultaneous error detection at two levels of syntactic annotation

LAW VI '12 Proceedings of the Sixth Linguistic Annotation Workshop
Multiple-choice cloze exercise generation through English grammar learning support

International Journal of Knowledge and Web Intelligence
Turkish constituent chunking with morphological and contextual features

CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Named entity recognition with multiple segment representations

Information Processing and Management: an International Journal
A Named Entity Recognition Method Based on Decomposition and Concatenation of Word Chunks

ACM Transactions on Asian Language Information Processing (TALIP)
MedTime: A temporal information extraction system for clinical narratives

Journal of Biomedical Informatics
Towards a Protein-Protein Interaction information extraction system: Recognizing named entities

Knowledge-Based Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Dividing sentences in chunks of words is a useful preprocessing step for parsing, information extraction and information retrieval. (Ramshaw and Marcus, 1995) have introduced a "convenient" data representation for chunking by converting it to a tagging task. In this paper we will examine seven different data representations for the problem of recognizing noun phrase chunks. We will show that the the data representation choice has a minor influence on chunking performance. However, equipped with the most suitable data representation, our memory-based learning chunker was able to improve the best published chunking results for a standard data set.