Large lexicons for natural language processing: utilising the grammar coding system of LDOCE
Computational Linguistics - Special issue of the lexicon
Experiments on incorporating syntactic processing of user queries into a document retrieval strategy
SIGIR '88 Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval
A french text recognition model for information retrieval system
SIGIR '88 Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval
Coefficients of combining concept classes in a collection
SIGIR '88 Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval
A cluster-based approach to thesaurus construction
SIGIR '88 Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval
The effectiveness of a nonsyntatic approach to automatic phrase indexing for document retrieval
Journal of the American Society for Information Science
Word sense disambiguation using machine-readable dictionaries
SIGIR '89 Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
On the application of syntactic methodologies in automatic text analysis
SIGIR '89 Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
Information Processing and Management: an International Journal
Inference networks for document retrieval
SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
Experiments with query acquisition and use in document retrieval systems
SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
Representation and learning in information retrieval
Representation and learning in information retrieval
Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval
Optimization of inverted vector searches
SIGIR '85 Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval
Computer Evaluation of Indexing and Text Processing
Journal of the ACM (JACM)
A news story categorization system
ANLC '88 Proceedings of the second conference on Applied natural language processing
Automatic Information Organization and Retrieval.
Automatic Information Organization and Retrieval.
ACM Transactions on Information Systems (TOIS)
Fast text processing for information retrieval
HLT '91 Proceedings of the workshop on Speech and Natural Language
Representation quality in text classification: an introduction and experiment
HLT '90 Proceedings of the workshop on Speech and Natural Language
The use of phrases and structured queries in information retrieval
SIGIR '91 Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval
An evaluation of phrasal and clustered representations on a text categorization task
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Use of syntactic context to produce term association lists for text retrieval
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic combination of multiple ranked retrieval systems
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Natural language information retrieval in digital libraries
Proceedings of the first ACM international conference on Digital libraries
Exploiting clustering and phrases for context-based information retrieval
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Clustering user queries of a search engine
Proceedings of the 10th international conference on World Wide Web
Query clustering using user logs
ACM Transactions on Information Systems (TOIS)
Efficient phrase querying with an auxiliary index
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
A layered approach to NLP-based information retrieval
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Information retrieval using robust natural language processing
ACL '92 Proceedings of the 30th annual meeting on Association for Computational Linguistics
Building a lexical domain map from text corpora
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Fast phrase querying with combined indexes
ACM Transactions on Information Systems (TOIS)
Distributional term representations: an experimental comparison
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Adaptive anti-spam filtering for agglutinative languages: a special case for Turkish
Pattern Recognition Letters
Information retrieval using robust natural language processing
HLT '91 Proceedings of the workshop on Speech and Natural Language
Feature selection and feature extraction for text categorization
HLT '91 Proceedings of the workshop on Speech and Natural Language
Document representation in natural language text retrieval
HLT '94 Proceedings of the workshop on Human Language Technology
The phrase-based vector space model for automatic retrieval of free-text medical documents
Data & Knowledge Engineering
TExtractor: a multilingual terminology extraction tool
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Wikipedia-based semantic interpretation for natural language processing
Journal of Artificial Intelligence Research
Feature generation for text categorization using world knowledge
IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Exploiting internal and external semantics for the clustering of short texts using world knowledge
Proceedings of the 18th ACM conference on Information and knowledge management
Word or phrase?: learning which unit to stress for information retrieval
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
TextGraphs-4 Proceedings of the 2009 Workshop on Graph-based Methods for Natural Language Processing
ECIR'03 Proceedings of the 25th European conference on IR research
Evaluating a temporal pattern detection method for finding research keys in bibliographical data
Transactions on rough sets XIV
Multimodal indexing based on semantic cohesion for image retrieval
Information Retrieval
Beyond the bag of words: a text representation for sentence selection
AI'06 Proceedings of the 19th international conference on Advances in Artificial Intelligence: Canadian Society for Computational Studies of Intelligence
Categorization of large text collections: feature selection for training neural networks
IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Query phrase expansion using wikipedia in patent class search
AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Hi-index | 0.00 |
Term clustering and syntactic phrase formation are methods for transforming natural language text. Both have had only mixed success as strategies for improving the quality of text representations for document retrieval. Since the strengths of these methods are complementary, we have explored combining them to produce superior representations. In this paper we discuss our implementation of a syntactic phrase generator, as well as our preliminary experiments with producing phrase clusters. These experiments show small improvements in retrieval effectiveness resulting from the use of phrase clusters, but it is clear that corpora much larger than standard information retrieval test collections will be required to thoroughly evaluate the use of this technique.