Using argumentation to retrieve articles with similar citations from MEDLINE

Authors:
Imad Tbahriti;Christine Chichester;Frédérique Lisacek;Patrick Ruch
Affiliations:
Geneva Bioinformatics (GeneBio) SA, Geneva and University of Geneva;Geneva Bioinformatics (GeneBio) SA, Geneva;Geneva Bioinformatics (GeneBio) SA, Geneva and Swiss Institute of Bioinformatics, Geneva;University Hospital of Geneva
Venue:
JNLPBA '04 Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications
Year:
2004

Citing 13
Cited 3

Combining mapping and citation analysis for evaluative bibliometric purposes: a bibliometric study

Journal of the American Society for Information Science
Evaluating evaluation measure stability

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Ranking retrieval systems without relevance judgments

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Summarizing scientific articles: experiments with relevance and rhetorical status

Computational Linguistics - Summarization
Who Links to Whom: Mining Linkage between Web Sites

ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Pathfinder networks and author cocitation analysis: a remapping of paradigmatic information scientists

Journal of the American Society for Information Science and Technology
Methods for ranking information retrieval systems without relevance judgments

Proceedings of the 2003 ACM symposium on Applied computing
A maximum entropy approach to identifying sentence boundaries

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Extracting the names of genes and gene products with a hidden Markov model

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
A methodology for terminology-based knowledge acquisition and integration

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Using contextual spelling correction to improve retrieval effectiveness in degraded text collections

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Distribution of information in biomedical abstracts and full-text publications

Bioinformatics
Protein name tagging for biomedical annotation in text

BioMed '03 Proceedings of the ACL 2003 workshop on Natural language processing in biomedicine - Volume 13

A baseline feature set for learning rhetorical zones using full articles in the biomedical domain

ACM SIGKDD Explorations Newsletter - Natural language processing and text mining
Argumentative feedback: a linguistically-motivated term expansion for information retrieval

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Features combination for extracting gene functions from MEDLINE

ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research

Quantified Score

Hi-index	0.01

Visualization

Abstract

The aim of this study is to investigate the relationships between citations and the scientific argumentation found in the abstract. We extracted citation lists from a set of 3200 full-text papers originating from a narrow domain. In parallel, we recovered the corresponding MEDLINE records for analysis of the argumentative moves. Our argumentative model is founded on four classes: PURPOSE, METHODS, RESULTS, and CONCLUSION. A Bayesian classifier trained on explicitly structured MEDLINE abstracts generates these argumentative categories. The categories are used to generate four different argumentative indexes. A fifth index contains the complete abstract, together with the title and the list of Medical Subject Headings (MeSH) terms. To appraise the relationship of the moves to the citations, the citation lists were used as the criteria for determining relatedness of articles, establishing a benchmark. Our results show that the average precision of queries with the PURPOSE and CONCLUSION features is the highest, while the precision of the RESULTS and METHODS features was relatively low. A linear weighting combination of the moves is proposed, which significantly improves retrieval of related articles.