Improving Speech Signal Models for Statistical Parametric Speech Synthesis

Khorram, Soheil; Sameti, Hossein

Improving Speech Signal Models for Statistical Parametric Speech Synthesis

, Ph.D. Dissertation Sharif University of Technology Khorram, Soheil (Author) ; Sameti, Hossein (Supervisor)

Abstract

Statistical parametric speech synthesis (SPSS) has dominated speech synthesis research area over the last decade, due to its remarkable advantages such as high intelligibility and flexibility. Decision tree-clustered context-dependent hidden semi-Markov models are typically used in SPSS to represent probability densities of acoustic features given contextual factors. This research addresses four major limitations of this decision tree-based structure: (a) The decision tree structure lacks adequate context generalization; (b) It is unable to express complex context dependencies; (c) Parameters generated from this structure represent sudden transitions between adjacent states; (e) This...

Improving Speech Signal Models for Statistical Parametric Speech Synthesis

Cataloging brief

Improving Speech Signal Models for Statistical Parametric Speech Synthesis

Find in content

Bookmark