Loading...

Comparative study of different excitation signals on Mel-generalized cepstral synthesis filters

Bahaadini, S ; Sharif University of Technology | 2011

838 Viewed
  1. Type of Document: Article
  2. DOI: 10.1109/AISP.2011.5960983
  3. Publisher: 2011
  4. Abstract:
  5. In speech production systems, the vocal tract is modeled by a filter and glottal pulse by an excitation signal. In most traditional systems impulse train or noise is used as the excitation. In this paper the effects of different excitation signals on Mel-generalized cepstral filters (LPC, warped LPC, mel cepstral and ML cepstral) are studied. Excitation signals with different pulse shapes are used. Furthermore, based on voicing power, noise factor is added to the excitation signals. Totally 600 different experiments with different filter types, number of coefficients, pulse shapes, pulse widths and noise power are preformed. Synthesized speech is evaluated by PESQ measure. A number of pulse shapes result in much better quality than the impulse. It is also shown that a simple noise addition method improves speech quality in many cases. The best excitation for each filter depends on its type
  6. Keywords:
  7. Mel-generalized cepstral filter ; Cepstral ; Comparative studies ; Excitation signals ; Glottal pulse ; Noise addition ; Noise factor ; Noise power ; PESQ ; Pulse shapes ; Pulse width ; Speech production ; Speech quality ; Synthesis filters ; Synthesized speech ; Traditional systems ; Vocal-tracts ; Voice quality ; Signal processing ; Artificial intelligence
  8. Source: 2011 International Symposium on Artificial Intelligence and Signal Processing, AISP 2011, 15 June 2011 through 16 June 2011 ; June , 2011 , Pages 15-19 ; 9781424498345 (ISBN)
  9. URL: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=5960983