Linguistic information extraction for job ads (SIRE project)

  • Authors:
  • Romain Loth;Delphine Battistelli;François-Régis Chaumartin;Hugues de Mazancourt;Jean-Luc Minel;Axelle Vinckx

  • Affiliations:
  • MoDyCo - UMR CNRS, Univ Paris Ouest Nanterre La Défense, Nanterre;MoDyCo - UMR CNRS, Univ Paris Ouest Nanterre La Défense, Nanterre;Proxem, Clichy;Lingway Labs, Le Kremlin-Bicêtre;MoDyCo - UMR CNRS, Univ Paris Ouest Nanterre La Défense, Nanterre;Lingway Labs, Le Kremlin-Bicêtre

  • Venue:
  • RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

As a text, each job advertisement expresses rich information about the occupation at hand, such as competence needs (i.e. required degrees, field knowledge, task expertise or technical skills). To facilitate the access to this information, the SIRE project conducted a corpus based study of how to articulate HR expert ontologies with modern semi-supervised information extraction techniques. An adaptive semantic labeling framework is developed through a parallel work on retrieval rules and on latent semantic lexicons of terms and jargon phrases. In its operational stage, our prototype will collect online job ads and index their content into detailed RDF triples compatible with applications ranging from enhanced job search to automated labor-market analysis.