Sharif Digital Repository / Sharif University of Technology / Search result

Birth-death frequencies variance of sinusoidal model a new feature for audio classification

, Article SIGMAP 2008 - International Conference on Signal Processing and Multimedia Applications, Porto, 26 July 2008 through 29 July 2008 ; 2008 , Pages 139-144 ; 9789898111609 (ISBN) Ghaemmaghami, S ; Shirazi, J ; Sharif University of Technology

2008

Abstract

In this paper, a new feature set for audio classification is presented and evaluated based on sinusoidal modeling of audio signals. Variance of the birth-death frequencies in sinusoidal model of signal, as a measure of harmony, is used and compared to typical features as the input into an audio classifier. The performance of this sinusoidal model feature is evaluated through classification of audio to speech and music using both the GMM and the SVM classifiers. Classification results show that the proposed feature is quite successful in speech/music classification. Experimental comparisons with popular features for audio classification, such as HZCRR and LSTER, are presented and discussed....

Audio classification based on sinusoidal model: a new feature

, Article 2008 IEEE Region 10 Conference, TENCON 2008, Hyderabad, 19 November 2008 through 21 November 2008 ; 2008 ; 1424424089 (ISBN); 9781424424085 (ISBN) Shirazi, J ; Ghaemmaghami, S ; Sharif University of Technology

2008

Abstract

In this paper, a new feature set is presented and evaluated based on sinusoidal modeling of audio signals. Duration of the longest sinusoidal model frequency track, as a measure of the harmony, is used and compared to typical features as input into an audio classifier. The performance of this sinusoidal model feature is evaluated through classification of audio to speech and music using both the GMM and the SVM classifiers. Classification results show the proposed feature, which could be used for the first time in such an audio classification, is quite successful in speech/music classification. Experimental comparisons with popular features for audio classification, such as HZCRR and LSTER,...

Improvements in audio classification based on sinusoidal modeling

, Article 2008 IEEE International Conference on Multimedia and Expo, ICME 2008, Hannover, 23 June 2008 through 26 June 2008 ; 2008 , Pages 1485-1488 ; 9781424425716 (ISBN) Shirazi, J ; Ghaemmaghami, S ; Razzazi, F ; Sharif University of Technology

2008

Abstract

In this paper, a set of features is presented and evaluated based on sinusoidal modeling of audio signals. Amplitude, frequency, and phase parameters of the sinusoidal model are used and compared as input features into an audio classifier system. The performance of sinusoidal model features is evaluated for classification of audio into speech and music classes using both the Gaussian and the GMM (Gaussian Mixture Model) classifiers. Experimental results show superiority of the amplitude parameters of the sinusoidal model, which could be used for the first time for such an audio classification, as compared to the popular cepstral features. By using a set of 40 sinusoidal features, we achieved...

Improvement to speech-music discrimination using sinusoidal model based features

, Article Multimedia Tools and Applications ; Volume 50, Issue 2 , November , 2010 , Pages 415-435 ; 13807501 (ISSN) Shirazi, J ; Ghaemmaghami, S ; Sharif University of Technology

2010

Abstract

This paper addresses a model-based audio content analysis for classification of speech-music mixed audio signals into speech and music. A set of new features is presented and evaluated based on sinusoidal modeling of audio signals. The new feature set, including variance of the birth frequencies and duration of the longest frequency track in sinusoidal model, as a measure of the harmony and signal continuity, is introduced and discussed in detail. These features are used and compared to typical features as inputs to an audio classifier. Performance of these sinusoidal model features is evaluated through classification of audio into speech and music using both the GMM (Gaussian Mixture Model)...