Performing Adaptive Morphological Analysis Using Internet Resources

  • Authors:
  • Marek Trabalka;Mária Bieliková

  • Affiliations:
  • -;-

  • Venue:
  • TSD '99 Proceedings of the Second International Workshop on Text, Speech and Dialogue
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we describe an approach to an adaptive morphological analysis based on lexicon corpus acquired from Internet. We focus on automating categorization words into a morphological paradigm in flexive languages. It is done by inducing possible word forms using morphological knowledge base and by looking for word forms of possible inflections in a morphological lexicon. We developed a prototype system based on the proposed approach. Our system is general (it respects language but it performs better on a flexive language). We tested the system for the Slovak language. System's lexicon is built by means of browsing Internet pages. Parsed texts, recognized to be written in Slovak, are used to establish database of Slovak words with their frequencies in texts.