Speaker phone mode classification using Gaussian mixture models

Please enable javascript in your browser.

Eghbal Zadeh, H ; Sharif University of Technology | 2011

523 Viewed

Type of Document: Article
Publisher: 2011
Abstract:
This study focuses on the mode classification of phones speaker modes using GMM 1. In this regard, speech data in both enabled and disabled speaker modes of cell phones and telephones were collected, processed and classified into two different categories. The different mixture numbers (1 to 4) of GMM and wave files sizes of 10, 20, 40 and 80 kb were tested in order to obtain an optimal condition for classification. The GMM method attained 87.99% correct classification rate on test data. This classification is important for speech enabled IVR 2 systems [1], dialog systems and many systems in speech processing in the sense that it could help to load an optimum model for increasing system accuracy
Keywords:
Speech enabled IVR ; Telephony device classification ; Cell phone ; Classification rates ; Dialog systems ; Gaussian Mixture Model ; Mode classification ; Optimal conditions ; Optimum model ; Speech data ; System accuracy ; Telephony speech recognition ; Test data ; Algorithms ; Signal processing ; Speech processing ; Speech recognition ; Telephone sets
Source: SPA 2011 - Signal Processing: Algorithms, Architectures, Arrangements, and Applications - Conference Proceedings, 29 September 2011 through 30 September 2011 ; September , 2011 , Pages 112-117 ; 9781457714863 (ISBN)
URL: http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=6190945&url=http%3A%2F%2Fieeexplore.ieee.org%2Fxpls%2Fabs_all.jsp%3Farnumber%3D6190945