Loading...

Noise and speaker robustness in a persian continuous speech recognition system

Veisi, H ; Sharif University of Technology | 2007

158 Viewed
  1. Type of Document: Article
  2. DOI: 10.1109/ISSPA.2007.4555292
  3. Publisher: 2007
  4. Abstract:
  5. In this paper VTLN speaker normalization, MLLR and MAP adaptation methods are investigated in a Persian HMM-based speaker independent large vocabulary continuous speech recognition system. Speaker and environmental noise robustness are achieved in real world applications for this system. A search-based method is used in VTLN to find speaker relative warping factors. The warping factors are applied to signal's spectrum to normalize the variation effect of VTL between speakers. In the MLLR framework, Gaussian mean and covariance transformations in global and full adaptation are experienced. In this method, regression tree based adaptation in batch-supervised fashion is used. Also the standard MAP is experienced as an adaptation method. Combinations of these approaches with CMN robust feature method are evaluated on 4 different tasks. Significant improvement is achieved in the recognition performance in noisy environments such that it makes the system operational in real applications. ©2007 IEEE
  6. Keywords:
  7. Computer networks ; Continuous speech recognition ; Signal processing ; Speech ; Speech analysis ; Weaving ; International symposium ; Persian ; Warping factors ; Speech recognition
  8. Source: 2007 9th International Symposium on Signal Processing and its Applications, ISSPA 2007, Sharjah, 12 February 2007 through 15 February 2007 ; 2007 ; 1424407796 (ISBN); 9781424407798 (ISBN)
  9. URL: https://ieeexplore.ieee.org/document/4555292