Broad-coverage sense disambiguation and information extraction with a supersense sequence tagger

Authors:
Massimiliano Ciaramita;Yasemin Altun
Affiliations:
Italian National Research Council;Toyota Technological Institute at Chicago
Venue:
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Year:
2006

Citing 19
Cited 33

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Maximum Entropy Markov Models for Information Extraction and Segmentation

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
A Hidden Markov Model Approach to Word Sense Disambiguation

IBERAMIA 2002 Proceedings of the 8th Ibero-American Conference on AI: Advances in Artificial Intelligence
TnT: a statistical part-of-speech tagger

ANLC '00 Proceedings of the sixth conference on Applied natural language processing
A system for identifying named entities in biomedical text: how results from two evaluations reflect on both the system and the evaluations: Conference Papers

Comparative and Functional Genomics
Shallow parsing with conditional random fields

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
One sense per discourse

HLT '91 Proceedings of the workshop on Speech and Natural Language
A semantic concordance

HLT '93 Proceedings of the workshop on Human Language Technology
Tuning support vector machines for biomedical named entity recognition

BioMed '02 Proceedings of the ACL-02 workshop on Natural language processing in the biomedical domain - Volume 3
Discriminative training methods for hidden Markov models: theory and experiments with perceptron algorithms

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Named Entity Extraction using AdaBoost

COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Named entity recognition through classifier combination

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Supersense tagging of unknown nouns in WordNet

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Finding predominant word senses in untagged text

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Classifying semantic relations in bioscience texts

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Supersense tagging of unknown nouns using semantic similarity

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Hidden-variable models for discriminative reranking

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
The GENIA corpus: an annotated research abstract corpus in molecular biology domain

HLT '02 Proceedings of the second international conference on Human Language Technology Research
Hierarchical semantic classification: word sense disambiguation with world knowledge

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence

Word sense disambiguation: A survey

ACM Computing Surveys (CSUR)
Frame Detection over the Semantic Web

ESWC 2009 Heraklion Proceedings of the 6th European Semantic Web Conference on The Semantic Web: Research and Applications
Easy as ABC?: facilitating pictorial communication via semantically enhanced layout

CoNLL '08 Proceedings of the Twelfth Conference on Computational Natural Language Learning
The CoNLL-2008 shared task on joint parsing of syntactic and semantic dependencies

CoNLL '08 Proceedings of the Twelfth Conference on Computational Natural Language Learning
Company-oriented extractive summarization of financial news

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
An empirical study on class-based word sense disambiguation

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Dependency parsing with second-order feature maps and annotated semantic information

IWPT '07 Proceedings of the 10th International Conference on Parsing Technologies
FBK-IRST: kernel methods for semantic relation extraction

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
GPLSI: word coarse-grained disambiguation aided by basic level concepts

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
MELB-YB: preposition sense disambiguation using rich semantic features

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
UNT-Yahoo: SuperSenseLearner: combining SenseLearner with supersense and other coarse semantic features

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
A comparative study on generalization of semantic roles in FrameNet

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Automatic identification of semantic relations in Italian complex nominals

IWCS-8 '09 Proceedings of the Eighth International Conference on Computational Semantics
The noisy channel model for unsupervised word sense disambiguation

Computational Linguistics
GPLSI-IXA: Using semantic classes to acquire monosemous training examples from domain texts

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Combining CBIR and NLP for multilingual terminology alignment and cross-language image indexing

YIWCALA '10 Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas
Adaptive parameters for entity recognition with perceptron HMMs

DANLP 2010 Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing
Semantic domains and supersense tagging for domain-specific ontology learning

Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
Semantic classification of automatically acquired nouns using lexico-syntactic clues

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Exploiting Semantic Information for HPSG Parse Selection

Research on Language and Computation
Piggyback: using search engines for robust cross-domain named entity recognition

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Benchmarking ARS: anaphora resolution system

i-KNOW '11 Proceedings of the 11th International Conference on Knowledge Management and Knowledge Technologies
ZenCrowd: leveraging probabilistic reasoning and crowdsourcing techniques for large-scale entity linking

Proceedings of the 21st international conference on World Wide Web
Using concept-level random walk model and global inference algorithm for answer summarization

AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
The latent words language model

Computer Speech and Language
Entropy-Guided feature generation for structured learning of portuguese dependency parsing

PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
Structural relationships for large-scale learning of answer re-ranking

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
The effects of semantic annotations on precision parse ranking

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Coarse lexical semantic annotation with supersenses: an Arabic case study

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
Semantic compositionality through recursive matrix-vector spaces

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Exploiting user feedback to learn to rank answers in q&a forums: a case study with stack overflow

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Building structures from classifiers for passage reranking

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Large-scale linked data integration using probabilistic reasoning and crowdsourcing

The VLDB Journal — The International Journal on Very Large Data Bases

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we approach word sense disambiguation and information extraction as a unified tagging problem. The task consists of annotating text with the tagset defined by the 41 Wordnet supersense classes for nouns and verbs. Since the tagset is directly related to Wordnet synsets, the tagger returns partial word sense disambiguation. Furthermore, since the noun tags include the standard named entity detection classes -- person, location, organization, time, etc. -- the tagger, as a by-product, returns extended named entity information. We cast the problem of supersense tagging as a sequential labeling task and investigate it empirically with a discriminatively-trained Hidden Markov Model. Experimental evaluation on the main sense-annotated datasets available, i.e., Semcor and Senseval, shows considerable improvements over the best known "first-sense" baseline.