Unsupervised discovery of relations for analysis of textual data

  • Authors:
  • A. L. Louis;A. P. Engelbrecht

  • Affiliations:
  • Council for Scientific and Industrial Research (CSIR), Meraka, Building 43, Meiring Naude Road, Pretoria 0001, South Africa and Department of Computer Science, University of Pretoria, Lynnwood Roa ...;Department of Computer Science, University of Pretoria, Lynnwood Road, Pretoria 0002, South Africa

  • Venue:
  • Digital Investigation: The International Journal of Digital Forensics & Incident Response
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper addresses the problem of analysing textual data for evidence discovery. A novel framework in which to perform evidence discovery is proposed in order to reduce the quantity of data to be analysed, aid the analysts' exploration of the data and enhance the intelligibility of the presentation of the data. The framework combines information extraction techniques with visual exploration techniques to provide a novel approach to performing evidence discovery, in the form of an evidence discovery system. By utilising unrestricted, unsupervised information extraction techniques, the investigator does not require input queries or keywords for searching, thus enabling the investigator to analyse portions of the data that may not have been identified by keyword searches. A preliminary study was performed to assess the usefulness of a text mining approach to evidence discovery from a text corpus in comparison with a traditional information retrieval approach. It was concluded that the novel approach to text analysis for evidence discovery presented in this paper is a viable and promising approach for consideration in digital forensics. The preliminary experiment showed that the results obtained from the evidence discovery system are sensible and useful.