Rules for ontology population from text of Malaysia medicinal herbs domain

  • Authors:
  • Zaharudin Ibrahim;Shahrul Azman Noah;Mahanem Mat Noor

  • Affiliations:
  • Dept. of Inf. System Management, Faculty of Inf. Management, Universiti Teknologi MARA, Shah Alam, Selangor, Malaysia and Dept. of Information Science, Faculty of Inf. Science and Techn., Universi ...;Department of Information Science, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Selangor, Malaysia;School of Biosciences and Biotechnology, Faculty Science and Technology, Universiti Kebangsaan Malaysia, Selangor, Malaysia

  • Venue:
  • RSKT'10 Proceedings of the 5th international conference on Rough set and knowledge technology
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The primary goal of ontology development is to share and reuse domain knowledge among people or machines. This study focuses on the approach of extracting semantic relationships from unstructured textual documents related to medicinal herb from websites and proposes a lexical pattern technique to acquire semantic relationships such as synonym, hyponym, and part-of relationships. The results show of nine object properties (or relations) and 105 lexico-syntactic patterns have been identified manually, including one from the Hearst hyponym rules. The lexical patterns have linked 7252 terms that have the potential as ontological terms. Based on this study, it is believed that determining the lexical pattern at an early stage is helpful in selecting relevant term from a wide collection of terms in the corpus. However, the relations and lexico-syntactic patterns or rules have to be verified by domain expert before employing the rules to the wider collection in an attempt to find more possible rules.