Loading...

Syllable duration prediction for Farsi text-to-speech systems

Nazari, B ; Sharif University of Technology | 2004

118 Viewed
  1. Type of Document: Article
  2. Publisher: Sharif University of Technology , 2004
  3. Abstract:
  4. In this paper, two different statistical approaches are used for duration prediction of the Farsi language. These two statistical models are Neural Networks (NN) and Classification And Regression Trees (CART). The first step in this work was to create a database and develop a flexible feature extraction and selection module. In the next step, the output of the feature selection module was used to train both models. The results of the trained models are further studied to determine the most important parameters affecting the syllable duration in Farsi, The model accuracy is evaluated by using separate training and test data. In the third step of this work, ah automatic rule generator module was added to the CART model. These duration prediction rules can be easily applied in a rule-based speech synthesis system. © Sharif University of Technology
  5. Keywords:
  6. Computer program ; Database systems ; Statistical methods ; Speech synthesis ; Neural networks ; Mathematical models ; Formal languages
  7. Source: Scientia Iranica ; Volume 11, Issue 3 , 2004 , Pages 225-233 ; 10263098 (ISSN)
  8. URL: http://scientiairanica.sharif.edu/article_2552.html