A hidden Markov model based named entity recognition system: Bengali and Hindi as case studies
PReMI'07 Proceedings of the 2nd international conference on Pattern recognition and machine intelligence
Hi-index | 0.00 |
This paper addresses two problems with toponym extraction and disambiguation. First, almost no existing works examine the extraction and disambiguation interdependency. Second, existing disambiguation techniques mostly take as input extracted toponyms without considering the uncertainty and imperfection of the extraction process. It is the aim of this paper to investigate both avenues and to show that explicit handling of the uncertainty of annotation has much potential for making both extraction and disambiguation more robust.