Data mining for grammatical inference with bioinformatics criteria

  • Authors:
  • Vivian F. López;Ramiro Aguilar;Luis Alonso;María N. Moreno;Juan M. Corchado

  • Affiliations:
  • Departamento Informática y Automática, University of Salamanca, Salamanca;Departamento Informática y Automática, University of Salamanca, Salamanca;Departamento Informática y Automática, University of Salamanca, Salamanca;Departamento Informática y Automática, University of Salamanca, Salamanca;Departamento Informática y Automática, University of Salamanca, Salamanca

  • Venue:
  • HAIS'10 Proceedings of the 5th international conference on Hybrid Artificial Intelligence Systems - Volume Part II
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we describe both theoretical and practical results of a novel data mining process that combines hybrid techniques of association analysis and classical sequentiation algorithms of genomics to generate grammatical structures of a specific language We used an application of a compilers generator system that allows the development of a practical application within the area of grammarware, where the concepts of the language analysis are applied to other disciplines, such as Bioinformatic The tool allows the complexity of the obtained grammar to be measured automatically from textual data A technique of incremental discovery of sequential patterns is presented to obtain simplified production rules, and compacted with bioinformatics criteria to make up a grammar.