Supervised machine learning for summarizing legal documents

  • Authors:
  • Mehdi Yousfi-Monod;Atefeh Farzindar;Guy Lapalme

  • Affiliations:
  • Laboratoire RALI RALI-DIRO Université de Montréal, Université de Montréal, Montréal, Québec, Canada;NLP Technologies Inc., Montréal, Québec, Canada;Laboratoire RALI RALI-DIRO Université de Montréal, Université de Montréal, Montréal, Québec, Canada

  • Venue:
  • AI'10 Proceedings of the 23rd Canadian conference on Advances in Artificial Intelligence
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a supervised machine learning approach for summarizing legal documents A commercial system for the analysis and summarization of legal documents provided us with a corpus of almost 4,000 text and extract pairs for our machine learning experiments That corpus was pre-processed to identify the selected source sentences in extracts from which we generated legal structured data We finally describe our sentence classification experiments relying on a Naive Bayes classifier using a set of surface, emphasis, and content features.