TY - GEN
T1 - Pattern recognition and feature selection for the development of a new artificial larynx
AU - Russell, Megan J.
AU - Rubin, David M.
AU - Marwala, Tshilidzi
AU - Wigdorowitz, Brian
PY - 2009
Y1 - 2009
N2 - A palatometer system is used to read in tongue-palate contact patterns made during speech. The purpose is ultimately to develop an artificial larynx which will operate by determining the intended speech, and then synthesising the voice in a way that will hopefully mimic the user's prelaryngectomy sound. This paper describes the pattern recognition and feature selection techniques used to extract information from the tongue-palate contact patterns, and the effects the various methods have on the classification results. Training and testing datasets were constructed using 50 common words. The following feature extraction methods were used on the datasets: Principal Component Analysis, Fourier Descriptors, Correlation, Image Properties and Generic Fourier Descriptors. Once the features were extracted they were then used as input to a Multi-Layer Perceptron (MLP) Neural Network. The best MLP-based classification rate for the testing dataset was 78%, and this was achieved with the input of Correlation Coefficients. Further research will be conducted to try to improve these classification rates using a voting scheme, and possibly the application of word context.
AB - A palatometer system is used to read in tongue-palate contact patterns made during speech. The purpose is ultimately to develop an artificial larynx which will operate by determining the intended speech, and then synthesising the voice in a way that will hopefully mimic the user's prelaryngectomy sound. This paper describes the pattern recognition and feature selection techniques used to extract information from the tongue-palate contact patterns, and the effects the various methods have on the classification results. Training and testing datasets were constructed using 50 common words. The following feature extraction methods were used on the datasets: Principal Component Analysis, Fourier Descriptors, Correlation, Image Properties and Generic Fourier Descriptors. Once the features were extracted they were then used as input to a Multi-Layer Perceptron (MLP) Neural Network. The best MLP-based classification rate for the testing dataset was 78%, and this was achieved with the input of Correlation Coefficients. Further research will be conducted to try to improve these classification rates using a voting scheme, and possibly the application of word context.
UR - http://www.scopus.com/inward/record.url?scp=77950119587&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-03882-2_196
DO - 10.1007/978-3-642-03882-2_196
M3 - Conference contribution
AN - SCOPUS:77950119587
SN - 9783642038815
T3 - IFMBE Proceedings
SP - 736
EP - 739
BT - World Congress on Medical Physics and Biomedical Engineering
PB - Springer Verlag
T2 - World Congress on Medical Physics and Biomedical Engineering: Image Processing, Biosignal Processing, Modelling and Simulation, Biomechanics
Y2 - 7 September 2009 through 12 September 2009
ER -