LattesMiner: a multilingual DSL for information extraction from lattes platform

  • Authors:
  • Alexandre D. Alves;Horacio H. Yanasse;Nei Y. Soma

  • Affiliations:
  • Instituto Nacional de Pesquisas Espaciais, São José dos Campos, Brazil;Instituto Nacional de Pesquisas Espaciais, São José dos Campos, Brazil;Instituto Tecnológico da Aeronáutica, São José dos Campos, Brazil

  • Venue:
  • Proceedings of the compilation of the co-located workshops on DSM'11, TMC'11, AGERE!'11, AOOPES'11, NEAT'11, & VMIL'11
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Lattes CV system, a curricular information system maintained by CNPq, is the core of the Lattes Platform. This system is undoubtedly the major source of information on Brazilian researchers. This paper describes "LattesMiner", a multilingual domain-specific language for automatic information extraction from Lattes curricula. It is composed by a set of classes written in Java that allows developers to implement their own applications with a high-level abstraction and expression power. LattesMiner can extract data belonging to the Lattes Platform from any individual researcher or group of researchers by its name or given (ID) number. The data extracted can be analyzed and used, for instance, to identify academic social networks, regional competences, profile of groups in diferent areas of research etc. We illustrate its use with a case study.