Image-to-MIDI mapping based on dynamic fuzzy color segmentation for visually impaired people

  • Authors:
  • Chieh-Li Chen;Yan-Fa Liao;Chung-Li Tai

  • Affiliations:
  • National Cheng Kung University, 1 University Road, Tainan, Taiwan;National Cheng Kung University, 1 University Road, Tainan, Taiwan;National Cheng Kung University, 1 University Road, Tainan, Taiwan

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2011

Quantified Score

Hi-index 0.10

Visualization

Abstract

In this paper, the RGB ratio is defined according to a reference color so that an image can be transformed from a conventional color space to the RGB ratio space. Different to traditional distance measurement, a road color model is determined by an ellipse area in the RGB ratio space enclosed by the estimated boundaries. The proposed dynamic fuzzy logic, where fuzzy membership functions are defined according to estimated boundaries, is introduced to implement clustering rules, such that each pixel will have its own fuzzy membership function corresponding to its intensity. A basic neural network is trained and used to achieve parameter optimization. Experimental results for road detection demonstrate the robustness of the proposed approach to variations in intensity. To provide obstacle information, especially for visually impaired people, Musical Instrument Digital Interface (MIDI) is introduced as the sound generator, and image-to-MIDI mapping algorithm is proposed. Experimental results show that the proposed method can adapt to various road types, and the resulting audio information successfully indicates the position and size of obstacles.