Loading...
Implementation and evaluation of statistical parametric speech synthesis methods for the Persian language
Bahaadini, S ; Sharif University of Technology | 2011
583
Viewed
- Type of Document: Article
- DOI: 10.1109/MLSP.2011.6064608
- Publisher: 2011
- Abstract:
- Scattered and little research in the field of Persian speech synthesis systems has been performed during the last ten years. Comprehensive framework that properly implements and adapts statistical speech synthesis methods for Persian has not been conducted yet. In this paper, recent statistical parametric speech synthesis methods including CLUSTERGEN, traditional HMM-based speech synthesis and its STRAIGHT version, are implemented and adapted for Persian language. CCR test is carried out to compare these methods with each other and with unit selection method. Listeners Score samples based on CMOS. The methods were ranked by averaging the CCR scores. The results show that STRAIGHT-based system produces the best quality. Traditional HMM-based and unit selection are second and third in quality ranking. These approximately produce the same quality. Finally CLUSTERGEN produces the worst quality among these four systems
- Keywords:
- Persian language ; speech synthesis ; statistical parametric ; text to speech ; CCR test ; HMM-based speech synthesis ; Persian speech ; Persians ; Quality ranking ; Synthesis method ; Unit selection ; Learning systems ; Signal processing
- Source: IEEE International Workshop on Machine Learning for Signal Processing, 18 September 2011 through 21 September 2011 ; September , 2011 , Page(s): 1 - 6 ; 9781457716232 (ISBN)
- URL: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6064608