Robust Speaker Identification System Based on Wavelet Transform and Gaussian Mixture Model.

التفاصيل البيبلوغرافية
العنوان: Robust Speaker Identification System Based on Wavelet Transform and Gaussian Mixture Model.
المؤلفون: Keh-Yih Su, Tsujii, Jun'ichi, Jong-Hyeok Lee, Oi Yee Kwong, Wan-Chen Chen, Ching-Tang Hsieh, Eugene Lai
المصدر: Natural Language Processing - IJCNLP 2004; 2005, p263-271, 9p
مستخلص: This paper presents an effective method for improving the performance of a speaker identification system. Based on the multiresolution property of the wavelet transform, the input speech signal is decomposed into various frequency bands in order not to spread noise distortions over the entire feature space. The linear predictive cepstral coefficients (LPCCs) of each band are calculated. Furthermore, the cepstral mean normalization technique is applied to all computed features. We use feature recombination and likelihood recombination methods to evaluate the task of the text-independent speaker identification. The feature recombination scheme combines the cepstral coefficients of each band to form a single feature vector used to train the Gaussian mixture model (GMM). The likelihood recombination scheme combines the likelihood scores of independent GMM for each band. Experimental results show that both proposed methods outperform the GMM model using full-band LPCCs and mel-frequency cepstral coefficients (MFCCs) in both clean and noisy environments. [ABSTRACT FROM AUTHOR]
Copyright of Natural Language Processing - IJCNLP 2004 is the property of Springer eBooks and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Supplemental Index