Improving Legal Document Summarization Using Graphical Models

  • Authors:
  • M. Saravanan;B. Ravindran;S. Raman

  • Affiliations:
  • Department of Computer Science and Engineering, IIT Madras, Chennai-600 036, Tamil Nadu, India;Department of Computer Science and Engineering, IIT Madras, Chennai-600 036, Tamil Nadu, India;Department of Computer Science and Engineering, IIT Madras, Chennai-600 036, Tamil Nadu, India

  • Venue:
  • Proceedings of the 2006 conference on Legal Knowledge and Information Systems: JURIX 2006: The Nineteenth Annual Conference
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we propose a novel idea for applying probabilistic graphical models for automatic text summarization task related to a legal domain. Identification of rhetorical roles present in the sentences of a legal document is the important text mining process involved in this task. A Conditional Random Field (CRF) is applied to segment a given legal document into seven labeled components and each label represents the appropriate rhetorical roles. Feature sets with varying characteristics are employed in order to provide significant improvements in CRFs performance. Our system is then enriched by the application of a term distribution model with structured domain knowledge to extract key sentences related to rhetorical categories. The final structured summary has been observed to be closest to 80% accuracy level to the ideal summary generated by experts in the area.