Information Processing and Management: an International Journal
Progress in the application of natural language processing to information retrieval tasks
The Computer Journal - Special issue on information retrieval
TREC-2 Proceedings of the second conference on Text retrieval conference
Natural language information retrieval
TREC-2 Proceedings of the second conference on Text retrieval conference
Natural language processing for information retrieval
Communications of the ACM
Theory of Syntactic Recognition for Natural Languages
Theory of Syntactic Recognition for Natural Languages
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Lexical semantic techniques for corpus analysis
Computational Linguistics - Special issue on using large corpora: II
Corpus statistics meet the noun compound: some empirical results
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Summarizing Similarities and Differences Among Related Documents
Information Retrieval
Automatic identification and organization of index terms for interactive browsing
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
Unit Completion for a Computer-aided Translation Typing System
Machine Translation
A Corpus-Based Learning Method of Compound Noun Indexing Rules for Korean
Information Retrieval
Concept Based Adaptive IR Model Using FCA-BAM Combination for Concept Representation and Encoding
Proceedings of the 24th BCS-IRSG European Colloquium on IR Research: Advances in Information Retrieval
AICS '02 Proceedings of the 13th Irish International Conference on Artificial Intelligence and Cognitive Science
Unit completion for a computer-aided translation typing system
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Evaluation of automatically identified index terms for browsing electronic documents
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Fast statistical parsing of noun phrases for document indexing
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
An automated system that assists in the generation of document indexes
Natural Language Engineering
A layered approach to NLP-based information retrieval
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
ACM Transactions on Asian Language Information Processing (TALIP)
Corpus-based learning of compound noun indexing
RANLPIR '00 Proceedings of the ACL-2000 workshop on Recent advances in natural language processing and information retrieval: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 11
Disambiguating noun compounds with latent semantic indexing
COMPUTERM '02 COLING-02 on COMPUTERM 2002: second international workshop on computational terminology - Volume 14
A risk minimization framework for information retrieval
Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
The phrase-based vector space model for automatic retrieval of free-text medical documents
Data & Knowledge Engineering
TExtractor: a multilingual terminology extraction tool
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Recognition and classification of noun phrases in queries for effective retrieval
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Unsupervised query segmentation using generative language models and wikipedia
Proceedings of the 17th international conference on World Wide Web
Relating dependent indexes using dempster-shafer theory
Proceedings of the 17th ACM conference on Information and knowledge management
Statistical Language Models for Information Retrieval A Critical Review
Foundations and Trends in Information Retrieval
A Hybrid Approach to Improve Bilingual Multiword Expression Extraction
PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Integrating phrase inseparability in phrase-based model
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Determining the syntactic structure of medical terms in clinical notes
BioNLP '07 Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing
Improving product review search experiences on general search engines
Proceedings of the 11th International Conference on Electronic Commerce
CorefApp '99 Proceedings of the Workshop on Coreference and its Applications
Terminology Extraction from Log Files
DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
Representing Context Information for Document Retrieval
FQAS '09 Proceedings of the 8th International Conference on Flexible Query Answering Systems
A risk minimization framework for information retrieval
Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
A bootstrapping approach for Chinese main verb identification
KES'07/WIRN'07 Proceedings of the 11th international conference, KES 2007 and XVII Italian workshop on neural networks conference on Knowledge-based intelligent information and engineering systems: Part I
A hybrid framework to extract bilingual multiword expression from free text
Expert Systems with Applications: An International Journal
Concept based representations for ranking in geographic information retrieval
IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
Towards the web of concepts: extracting concepts from large datasets
Proceedings of the VLDB Endowment
Pruning terminology extracted from a specialized corpus for CV ontology acquisition
OTM'06 Proceedings of the 2006 international conference on On the Move to Meaningful Internet Systems: AWeSOMe, CAMS, COMINF, IS, KSinBIT, MIOS-CIAO, MONET - Volume Part II
A set of NP-Extraction rules for portuguese: defining, learning and pruning
PROPOR'06 Proceedings of the 7th international conference on Computational Processing of the Portuguese Language
Identifying well-formed biomedical phrases in MEDLINE® text
Journal of Biomedical Informatics
Hi-index | 0.00 |
Information retrieval is an important application area of natural-language processing where one encounters the genuine challenge of processing large quantities of unrestricted natural-language text. This paper reports on the application of a few simple, yet robust and efficient noun-phrase analysis techniques to create better indexing phrases for information retrieval. In particular, we describe an hybrid approach to the extraction of meaningful (continuous or discontinuous) subcompounds from complex noun phrases using both corpus statistics and linguistic heuristics. Results of experiments show that indexing based on such extracted subcompound improves both recall and precision in an information retrieval system. The noun-phrase analysis techniques are also potentially useful for book indexing and automatic thesaurus extraction.