An information-theoretic approach to automatic query expansion
ACM Transactions on Information Systems (TOIS)
Similarity metrics for aligning children's articulation data
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Translating unknown queries with web corpora for cross-language information retrieval
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Backward machine transliteration by learning phonetic similarity
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Translating–transliterating named entities for multilingual information access
Journal of the American Society for Information Science and Technology
Multitype Features Coselection for Web Document Clustering
IEEE Transactions on Knowledge and Data Engineering
ACM Transactions on Asian Language Information Processing (TALIP)
Measuring similarity between transliterations against noise data
ACM Transactions on Asian Language Information Processing (TALIP)
The Google Similarity Distance
IEEE Transactions on Knowledge and Data Engineering
A phonetic similarity model for automatic extraction of transliteration pairs
ACM Transactions on Asian Language Information Processing (TALIP)
Translating names and technical terms in Arabic text
Semitic '98 Proceedings of the Workshop on Computational Approaches to Semitic Languages
Hi-index | 0.01 |
We present a framework for mining synonymous transliterations from a set of Web pages collected via a search engine. An integrated statistical measure is proposed to form search keywords for a search engine in order to retrieve relevant Web snippets. We employ a scheme of comparing the similarity between two transliterations to aid in identifying synonymous transliterations. Experimental results show that the average number of harvesting synonymous transliterations is about 5.04 for an input transliteration. The retrieval results could be beneficial for constructing ontology, especially, in the domain of foreign person names.