Loading...

Mel-scaled Discrete Wavelet Transform and dynamic features for the Persian phoneme recognition

Tavanaei, A ; Sharif University of Technology | 2011

967 Viewed
  1. Type of Document: Article
  2. DOI: 10.1109/AISP.2011.5960989
  3. Publisher: 2011
  4. Abstract:
  5. In this paper we use a feature vector consisting of the Mel Frequency Discrete Wavelet Coefficients to recognize spoken phonemes in the Persian language. The purpose of using wavelet in feature extraction is to benefit from its multi resolution analysis and localization property in time and frequency domains. The MFDWCs are obtained by applying the Discrete Wavelet Transform (DWT) to the Mel-scaled log filter bank energies of a speech frame. Feature vectors are used for the HMM-based phoneme recognition on a portion of the FarsDat Persian language database consisting of 35 hour recorded data for training and 15 hour for testing. We evaluate the performance of new features for clean speech and noisy speech and compare it with the Mel Frequency Cepstral Coefficients (MFCC). Experiments on a phone recognition task based on the MFDWC give better result than recognizers based on the MFCC features for both white noise and clean speech cases
  6. Keywords:
  7. Wavelet transform ; Clean speech ; Dynamic features ; FARSDAT ; Feature vectors ; Localization properties ; Mel-frequency cepstral coefficients ; Mel-frequency discrete wavelet coefficients ; mel-scaled wavelet transform ; MFCC ; Multi-resolutions ; Noisy speech ; Persians ; Phone recognition ; Phoneme recognition ; Speech frames ; Task-based ; Time and frequency domains ; Artificial intelligence ; Discrete wavelet transforms ; Feature extraction ; Filter banks ; Metadata ; White noise ; Speech recognition
  8. Source: 2011 International Symposium on Artificial Intelligence and Signal Processing, AISP 2011, 15 June 2011 through 16 June 2011 ; June , 2011 , Pages 138-140 ; 9781424498345 (ISBN)
  9. URL: http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=5960989&url=http%3A%2F%2Fieeexplore.ieee.org%2Fiel5%2F5955054%2F5960967%2F05960989.pdf%3Farnumber%3D5960989