Abstract
This paper is concerned with the search for an optimal feature-set for a speech recognition system. A better acoustic feature analysis that suitably enhances the semantic information in a consistent fashion can reduce raw-score (no grammar) error rate significantly. A simple two-dimensional parameterized feature-set is proposed. The feature-set is compared against a standard mel-cepstrum, LPC-based feature-set in talker-independent, connected-alphadigit HMM-based recognizer. The results show that a particular combination of parameters yields a significantly lower error rate than the baseline mel-cepstrum LPC-based feature-set.
Original language | English |
---|---|
Pages (from-to) | 17-20 |
Number of pages | 4 |
Journal | Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing |
Volume | 1 |
Publication status | Published - 1996 |
Externally published | Yes |
Event | Proceedings of the 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP. Part 1 (of 6) - Atlanta, GA, USA Duration: 7 May 1996 → 10 May 1996 |
ASJC Scopus subject areas
- Software
- Signal Processing
- Electrical and Electronic Engineering