Application of Cross-Language Criteria for the Automatic Distinction of Expert and Non Expert Online Health Documents

Authors:
Natalia Grabar;Sonia Krivine
Affiliations:
INSERM, UMR_S 729, Eq. 20, Paris, F-75006, France and Health on the Net Foundation, SIM/HUG, Geneva, Switzerland;FircoSoft, 37 rue de Lyon, 75012 Paris, France
Venue:
AIME '07 Proceedings of the 11th conference on Artificial Intelligence in Medicine
Year:
2007

Citing 1
Cited 1

Automatic Recognition of Text Difficulty from Consumers Health Information

CBMS '06 Proceedings of the 19th IEEE Symposium on Computer-Based Medical Systems

Assessing user-specific difficulty of documents

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Distinction between expert and non expert documents is an important issue in the medical area, for instance in the context of information retrieval. In our work we address this issue through stylistic corpus analysis and application of machine learning algorithms. Our hypothesis is that this distinction can be observed on the basis of a little number of criteria and that such criteria can be language and domain independent. The used criteria have been acquired in source corpus (Russian) and then tested on source and target (French) corpora. The method shows up to 90% precision and 93% recall, and 85% precision and 74% recall in source and target corpora.