Development of the LEMS speech recognizer: Improving performance using feature-sets

Research output: Contribution to conferencePaperpeer-review

Abstract

This paper discusses the work performed at the Laboratory of Engineering Man-Machine Systems (LEMS) to build a state-of-the-art speech recognizer. In a period of about four years the performance of the speech recognizer has been improved from 85% to 92%, representing a 53% reduction in error rate. These performance gains were obtained by improving the feature-set algorithms and the HMM models. The main change in the feature-set was switching from the once popular LPC-based methods to a novel DFT-based method. The parameterized DFT-based method improved performance and confirmed was has been generally accepted that mel-scale warping is superior for machine speech recognition. Performance gains were also achieved by using the semi-continuous HMM model instead of the fast discrete HMM system. This change appears to offer a fixed 2-3% recognition rate improvement.

Original languageEnglish
Pages157-160
Number of pages4
Publication statusPublished - 1997
Externally publishedYes
EventProceedings of the 1997 South African Symposium on Communications and Signal Processing, COMSIG'97 - Garahamstown, S Afr
Duration: 9 Sept 199710 Sept 1997

Conference

ConferenceProceedings of the 1997 South African Symposium on Communications and Signal Processing, COMSIG'97
CityGarahamstown, S Afr
Period9/09/9710/09/97

ASJC Scopus subject areas

  • Signal Processing

Fingerprint

Dive into the research topics of 'Development of the LEMS speech recognizer: Improving performance using feature-sets'. Together they form a unique fingerprint.

Cite this