Stop classification using DESA-1 high-resolution formant tracking

  • Authors:
  • J. T. Foote;D. J. Mashao;H. F. Silverman

  • Affiliations:
  • Laboratory for Engineering Man-Machine Systems, Brown University, Providence, RI;Laboratory for Engineering Man-Machine Systems, Brown University, Providence, RI;Laboratory for Engineering Man-Machine Systems, Brown University, Providence, RI

  • Venue:
  • ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
  • Year:
  • 1993

Quantified Score

Hi-index 0.00

Visualization

Abstract

Recent work has verified that the second-formant frequency (F2) and its change in the vowel immediately preceding a stop consonant are usually sufficient to discriminate between labial, palatal, and alveolar stops, even in the absence of the stop burst information. Informal listening tests using truncated samples indicate that humans can discriminate the three stops on the basis of the preceding vowel alone. Typical quasi-stationary analyses like LPC and DFT filterbanks may not have sufficient time-frequency resolution to detect the rapid F2 variations, and therefore a valuable source of stop classification information is being overlooked. This paper shows the results of using the DESA-1 quadratic frequency estimator to determine the frequency and rate of change of F2. It is shown for different vowel environments that the DESA-1 algorithm can extract sufficient information to classify stops from vocalic data. The performance is demonstrated to be superior to a formant tracker using a more conventional pitch-synchronous LPC analysis.