Grammatical inference with bioinformatics criteria

  • Authors:
  • Vivian F. López;Ramiro Aguilar;Luis Alonso;María N. Moreno;Juan M. Corchado

  • Affiliations:
  • Departamento Informática y Automática, University of Salamanca, Plaza de la Merced S/N, 37008 Salamanca, Spain;Departamento Informática y Automática, University of Salamanca, Plaza de la Merced S/N, 37008 Salamanca, Spain;Departamento Informática y Automática, University of Salamanca, Plaza de la Merced S/N, 37008 Salamanca, Spain;Departamento Informática y Automática, University of Salamanca, Plaza de la Merced S/N, 37008 Salamanca, Spain;Departamento Informática y Automática, University of Salamanca, Plaza de la Merced S/N, 37008 Salamanca, Spain

  • Venue:
  • Neurocomputing
  • Year:
  • 2012

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper we describe both the theoretical and practical results of a novel approach that combines hybrid techniques of association analysis and classical sequentiation algorithms of genomics to generate the grammatical structures of a specific language. We used an application of a compiler generator system that allows a practical application to be developed within the area of grammarware, where the concepts of language analysis are applied to other disciplines, such as bioinformatics. The tool allows the complexity of the obtained grammar to be measured automatically from textual data. A technique involving the incremental discovery of sequential patterns is presented to obtain simplified production rules, and compacted with bioinformatics criteria to make up a grammar.