Cross-Language Retrieval for the CLEF Collections - Comparing Multiple Methods of Retrieval
CLEF '00 Revised Papers from the Workshop of Cross-Language Evaluation Forum on Cross-Language Information Retrieval and Evaluation
Accurate methods for the statistics of surprise and coincidence
Computational Linguistics - Special issue on using large corpora: I
A simple rule-based part of speech tagger
ANLC '92 Proceedings of the third conference on Applied natural language processing
Exploiting a controlled vocabulary to improve collection selection and retrieval effectiveness
Proceedings of the tenth international conference on Information and knowledge management
Harvesting translingual vocabulary mappings for multilingual digital libraries
Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
Translingual vocabulary mappings for multilingual information access
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Exploiting Manual Indexing to Improve Collection Selection and Retrieval Effectiveness
Information Retrieval
Cross-Language Access to Recorded Speech in the MALACH Project
TSD '02 Proceedings of the 5th International Conference on Text, Speech and Dialogue
Grouping of TRIZ Inventive Principles to facilitate automatic patent classification
Expert Systems with Applications: An International Journal
Logistic Regression and EVIs for XML Books and the Heterogeneous Track
Focused Access to XML Documents
Back to basics - again - for domain-specific retrieval
CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
Sub-Word Indexing and Blind Relevance Feedback for English, Bengali, Hindi, and Marathi IR
ACM Transactions on Asian Language Information Processing (TALIP)
Document expansion, query translation and language modeling for ad-hoc IR
CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
Moving towards adaptive search in digital libraries
NLP4DL'09/AT4DL'09 Proceedings of the 2009 international conference on Advanced language technologies for digital libraries
A baseline for NLP in domain-specific IR
CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
Reranking documents with antagonistic terms
CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
Hi-index | 0.00 |
This paper describes a search technology which enables improved search across diverse genres of digital objects -- documents, patents, cross-language retrieval, numeric data and images. The technology leverages human indexing of objects in specialized domains to provide increased accessibility to non-expert searchers. Our approach is the reverse-engineer text categorization to supply mappings from ordinary language vocabulary to specialist vocabulary by constructing maximum likelihood mappings between words and phrases and classification schemes. This forms the training data or 'entry vocabulary'; subsequently user queries are matched against the entry vocabulary to expand the search universe. The technology has been applied to search of patent databases, numeric economic statistics, and foreign language document collections.