Modified Segmental Histogram Equalization for robust speaker verification

Marshalleno Skosan, Daniel Mashao

Research output: Contribution to journalArticlepeer-review

15 Citations (Scopus)

Abstract

It is well known that when there is an acoustic mismatch between the speech obtained during training and testing the accuracy of speaker recognition systems drastically deteriorates. In this paper we propose Modified Segmental Histogram Equalization to improve the robustness of a speaker verification system operating in telephone environments. The technique transforms the features extracted from short adjacent segments of speech within an utterance such that their statistics conform to that of a Gaussian distribution with zero mean and unity variance across all recording conditions. In doing so, the feature statistics become less environment-dependent. Experiments performed on the NIST 2000 database show significant improvements in performance.

Original languageEnglish
Pages (from-to)479-486
Number of pages8
JournalPattern Recognition Letters
Volume27
Issue number5
DOIs
Publication statusPublished - 1 Apr 2006
Externally publishedYes

Keywords

  • Histogram Equalization
  • Mismatched conditions
  • NIST 2000
  • Speaker verification

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Modified Segmental Histogram Equalization for robust speaker verification'. Together they form a unique fingerprint.

Cite this