KP-Miner: A keyphrase extraction system for English and Arabic documents

  • Authors:
  • Samhaa R. El-Beltagy;Ahmed Rafea

  • Affiliations:
  • Faculty of Computers and Information, Computer Science Department, Cairo University, 5 Dr. Ahmed Zewail Street, 12613 Orman, Giza, Egypt;Computer Science Department, American University in Cairo, 113 Kasr El Aini St., PO Box 2511, Cairo 11511, Egypt

  • Venue:
  • Information Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Automatic keyphrase extraction has many important applications including but not limited to summarization, cataloging/indexing, feature extraction for clustering and classification, and data mining. This paper presents the KP-Miner system, and demonstrates through experimentation and comparison with widely used systems that it is effective and efficient in extracting keyphrases from both English and Arabic documents of varied length. Unlike other existing keyphrase extraction systems, the KP-Miner system does not need to be trained on a particular document set in order to achieve its task. It also has the advantage of being configurable as the rules and heuristics adopted by the system are related to the general nature of documents and keyphrases. This implies that the users of this system can use their understanding of the document(s) being input into the system to fine-tune it to their particular needs.