Lexical and Syntactic knowledge for Information Retrieval

Authors:
Antonio Ferrández
Affiliations:
Dept. Languages and Information Systems, Carretera San Vicente S/N, University of Alicante, 03080 Alicante, Spain
Venue:
Information Processing and Management: an International Journal
Year:
2011

Citing 32
Cited 1

Attention, intentions, and the structure of discourse

Computational Linguistics
On modeling of information retrieval concepts in vector spaces

ACM Transactions on Database Systems (TODS)
The effectiveness of a nonsyntatic approach to automatic phrase indexing for document retrieval

Journal of the American Society for Information Science
Using WordNet to disambiguate word senses for text retrieval

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Using statistical testing in the evaluation of retrieval experiments

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Pivoted document length normalization

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Statistical phrases for vector-space information retrieval (poster abstract)

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Efficient passage ranking for document databases

ACM Transactions on Information Systems (TOIS)
The use of phrases from query texts in information retrieval (poster session)

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
An Empirical Approach to Spanish Anaphora Resolution

Machine Translation
Exploiting syntactic analysis of queries for information retrieval

Data & Knowledge Engineering
On the Usefulness of Extracting Syntactic Dependencies for Text Indexing

AICS '02 Proceedings of the 13th Irish International Conference on Artificial Intelligence and Cognitive Science
Exploring term dependences in probabilistic information retrieval model

Information Processing and Management: an International Journal
Providing a unified account of definite noun phrases in discourse

ACL '83 Proceedings of the 21st annual meeting on Association for Computational Linguistics
An effective approach to document retrieval via utilizing WordNet and recognizing phrases

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
The effect of document retrieval quality on factoid question answering performance

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Probabilistic information retrieval model for a dependency structured indexing system

Information Processing and Management: an International Journal
A Markov random field model for term dependencies

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Corpus-based learning of compound noun indexing

RANLPIR '00 Proceedings of the ACL-2000 workshop on Recent advances in natural language processing and information retrieval: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 11
Noun phrases in interactive query expansion and document ranking

Information Retrieval
Term proximity scoring for ad-hoc retrieval on very large text collections

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Integration of an XML electronic dictionary with linguistic tools for natural language processing

Information Processing and Management: an International Journal
An exploration of proximity measures in information retrieval

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
A new robust relevance model in the language model framework

Information Processing and Management: an International Journal
Lexical cohesion and term proximity in document ranking

Information Processing and Management: an International Journal
Extraction of complex index terms in non-English IR: A shallow parsing based approach

Information Processing and Management: an International Journal
Answering questions with an n-gram based passage retrieval engine

Journal of Intelligent Information Systems
Term proximity scoring for keyword-based retrieval systems

ECIR'03 Proceedings of the 25th European conference on IR research
Overview of ResPubliQA 2009: question answering evaluation over European legislation

CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
Lexical normalization and relationship alternatives for a term dependence model in information retrieval

CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
A passage retrieval system for multilingual question answering

TSD'05 Proceedings of the 8th international conference on Text, Speech and Dialogue
Passage retrieval vs. document retrieval in the CLEF 2006 ad hoc monolingual tasks with the IR-n system

CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval

Ontology-supported case-based reasoning approach for intelligent m-Government emergency response services

Decision Support Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Traditional Information Retrieval (IR) models assume that the index terms of queries and documents are statistically independent of each other, which is intuitively wrong. This paper proposes the incorporation of the lexical and syntactic knowledge generated by a POS-tagger and a syntactic Chunker into traditional IR similarity measures for including this dependency information between terms. Our proposal is based on theories of discourse structure by means of the segmentation of documents and queries into sentences and entities. Therefore, we measure dependencies between entities instead of between terms. Moreover, we handle discourse references for each entity. It has been evaluated on Spanish and English corpora as well as on Question Answering tasks obtaining significant increases.