The AMTEx approach in the medical document indexing and retrieval application

Authors:
Angelos Hliaoutakis;Kaliope Zervanou;Euripides G. M. Petrakis
Affiliations:
Department of Electronic and Computer Engineering, Technical University of Crete (TUC), Chania, Greece;Department of Electronic and Computer Engineering, Technical University of Crete (TUC), Chania, Greece;Department of Electronic and Computer Engineering, Technical University of Crete (TUC), Chania, Greece
Venue:
Data & Knowledge Engineering
Year:
2009

Citing 10
Cited 9

Automatic text processing: the transformation, analysis, and retrieval of information by computer

Automatic text processing: the transformation, analysis, and retrieval of information by computer
Foundations of statistical natural language processing

Foundations of statistical natural language processing
KEA: practical automatic keyphrase extraction

Proceedings of the fourth ACM conference on Digital libraries
An Approach for Measuring Semantic Similarity between Words Using Multiple Information Sources

IEEE Transactions on Knowledge and Data Engineering
Automatically identifying gene/protein terms in MEDLINE abstracts

Journal of Biomedical Informatics
A methodology for automatic term recognition

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Towards automatic extraction of monolingual and bilingual terminology

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Narrative text classification for automatic key phrase extraction in web document corpora

Proceedings of the 7th annual ACM international workshop on Web information and data management
Automatic document indexing in large medical collections

HIKM '06 Proceedings of the international workshop on Healthcare information and knowledge management
Using measures of semantic relatedness for word sense disambiguation

CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing

Biomedical concept extraction based on combining the content-based and word order similarities

Proceedings of the 2011 ACM Symposium on Applied Computing
Automatic term identification by user profile for document categorisation in Medline

NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
Voting techniques for a multi-terminology based biomedical information retrieval

AIME'11 Proceedings of the 13th conference on Artificial intelligence in medicine
Towards a context sensitive approach to searching information based on domain specific knowledge sources

Web Semantics: Science, Services and Agents on the World Wide Web
Term extraction from sparse, ungrammatical domain-specific documents

Expert Systems with Applications: An International Journal
Factors affecting the effectiveness of biomedical document indexing and retrieval based on terminologies

Artificial Intelligence in Medicine
Towards personalized medical document classification by leveraging UMLS semantic network

HIS'13 Proceedings of the second international conference on Health Information Science
MedRank: discovering influential medical treatments from literature by information network analysis

ADC '13 Proceedings of the Twenty-Fourth Australasian Database Conference - Volume 137
Editorial: COMPENDIUM: A text summarization system for generating abstracts of research papers

Data & Knowledge Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

AMTEx is a medical document indexing method, specifically designed for the automatic indexing of documents in large medical collections, such as MEDLINE, the premier bibliographic database of the US National Library of Medicine (NLM). AMTEx combines MeSH, the terminological thesaurus resource of NLM, with a well-established method for extraction of terminology, the C/NC-value method. The performance evaluation of two AMTEx configurations is measured against the current state-of-the-art, the MetaMap Transfer (MMTx) method in four experiments, using two types of corpora: a subset of MEDLINE (PMC) full document corpus and a subset of MEDLINE (OHSUMED) abstracts, for each of the indexing and retrieval tasks, respectively. The experimental results demonstrate that AMTEx performs better in indexing in 20-50% of the processing time compared to MMTx, while for the retrieval task, AMTEx performs better in the full text (PMC) corpus.