Improvements in audio classification based on sinusoidal modeling

Please enable javascript in your browser.

Shirazi, J ; Sharif University of Technology | 2008

530 Viewed

Type of Document: Article
DOI: 10.1109/ICME.2008.4607727
Publisher: 2008
Abstract:
In this paper, a set of features is presented and evaluated based on sinusoidal modeling of audio signals. Amplitude, frequency, and phase parameters of the sinusoidal model are used and compared as input features into an audio classifier system. The performance of sinusoidal model features is evaluated for classification of audio into speech and music classes using both the Gaussian and the GMM (Gaussian Mixture Model) classifiers. Experimental results show superiority of the amplitude parameters of the sinusoidal model, which could be used for the first time for such an audio classification, as compared to the popular cepstral features. By using a set of 40 sinusoidal features, we achieved 95.06% accuracy in the audio classification at frame level, as compared to 92.26% accuracy obtained with the MFCC coefficients, as tested over the same audio corpus. © 2008 IEEE
Keywords:
Classifiers ; Exhibitions ; Learning systems ; Trellis codes ; Amplitude parameters ; Audio classifications ; Audio signals ; Cepstral features ; Classifier systems ; Gaussian ; Gaussian mixture models ; Input features ; Modeling ; Phase parameters ; Sinusoidal models ; Audio acoustics
Source: 2008 IEEE International Conference on Multimedia and Expo, ICME 2008, Hannover, 23 June 2008 through 26 June 2008 ; 2008 , Pages 1485-1488 ; 9781424425716 (ISBN)
URL: https://ieeexplore.ieee.org/document/4607727