Word extraction from corpora and its part-of-speech estimation using distributional analysis

Authors:
Shinsuke Mori;Makoto Nagao
Affiliations:
Kyoto University, Kyoto, Japan;Kyoto University, Kyoto, Japan
Venue:
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
Year:
1996

Citing 2
Cited 9

Distributional part-of-speech tagging

EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
Overview of the fifth DARPA speech and natural language workshop

HLT '91 Proceedings of the workshop on Speech and Natural Language

A part of speech estimation method for Japanese unknown words using a statistical model of morphology and context

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Morphological analysis of the spontaneous speech corpus

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 2
Morphological analysis of a large spontaneous speech corpus in Japanese

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Guessing parts-of-speech of unknown words using global information

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Japanese unknown word identification by character-based chunking

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Online acquisition of Japanese unknown morphemes using morphological constraints

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Morphological annotation of a large spontaneous speech corpus in Japanese

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Unsupervised Text Normalization Approach for Morphological Analysis of Blog Documents

AI '09 Proceedings of the 22nd Australasian Joint Conference on Advances in Artificial Intelligence
Semantic classification of automatically acquired nouns using lexico-syntactic clues

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters

Quantified Score

Hi-index	0.00

Visualization

Abstract

Unknown words are inevitable at any step of analysis in natural language processing. We propose a method to extract words from a corpus and estimate the probability that each word belongs to given parts of speech (POSs), using a distributional analysis. Our experiments have shown that this method is effective for inferring the POS of unknown words.