A comparative study of various speech recognition techniques.
Pitchers, Richard Charles.
MetadataShow full item record
Speech recognition systems fall into four categories, depending on whether they are speaker-dependent or independent of speaker population and on whether they are capable of recognizing continuous speech or only isolated words. A study was made of most methods used in speech recognition to date. Four speech recognition techniques for speaker-dependent isolated word applications were then implemented in software on an IBM PC with a minimum of interfacing hardware. These techniques made use of short-time energy and zero-crossing rates, autocorrelation coefficients, linear predictor coefficients and cepstral coefficients. A comparison of their relative performances was made using four test vocabularies that were 10, 30, 60 and 120 words in size. These consisted of 10 digits, 30 and 60 computer terms and lastly 120 airline reservation terms. The performance of any speech recognition system is affected by a number of parameters. The effects of frame length, pre-emphasis, window functions, dynamic time warping and the filter order were also studied experimentally.