Application of Cross-Language Criteria for the Automatic Distinction of Expert and Non Expert Online Health Documents

  • Authors:
  • Natalia Grabar;Sonia Krivine

  • Affiliations:
  • INSERM, UMR_S 729, Eq. 20, Paris, F-75006, France and Health on the Net Foundation, SIM/HUG, Geneva, Switzerland;FircoSoft, 37 rue de Lyon, 75012 Paris, France

  • Venue:
  • AIME '07 Proceedings of the 11th conference on Artificial Intelligence in Medicine
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Distinction between expert and non expert documents is an important issue in the medical area, for instance in the context of information retrieval. In our work we address this issue through stylistic corpus analysis and application of machine learning algorithms. Our hypothesis is that this distinction can be observed on the basis of a little number of criteria and that such criteria can be language and domain independent. The used criteria have been acquired in source corpus (Russian) and then tested on source and target (French) corpora. The method shows up to 90% precision and 93% recall, and 85% precision and 74% recall in source and target corpora.