Abstract
This paper discusses the work performed at the Laboratory of Engineering Man-Machine Systems (LEMS) to build a state-of-the-art speech recognizer. In a period of about four years the performance of the speech recognizer has been improved from 85% to 92%, representing a 53% reduction in error rate. These performance gains were obtained by improving the feature-set algorithms and the HMM models. The main change in the feature-set was switching from the once popular LPC-based methods to a novel DFT-based method. The parameterized DFT-based method improved performance and confirmed was has been generally accepted that mel-scale warping is superior for machine speech recognition. Performance gains were also achieved by using the semi-continuous HMM model instead of the fast discrete HMM system. This change appears to offer a fixed 2-3% recognition rate improvement.
Original language | English |
---|---|
Pages | 157-160 |
Number of pages | 4 |
Publication status | Published - 1997 |
Externally published | Yes |
Event | Proceedings of the 1997 South African Symposium on Communications and Signal Processing, COMSIG'97 - Garahamstown, S Afr Duration: 9 Sept 1997 → 10 Sept 1997 |
Conference
Conference | Proceedings of the 1997 South African Symposium on Communications and Signal Processing, COMSIG'97 |
---|---|
City | Garahamstown, S Afr |
Period | 9/09/97 → 10/09/97 |
ASJC Scopus subject areas
- Signal Processing