Loading...
Light-sernet: a lightweight fully convolutional neural network for speech emotion recognition
Aftab, A ; Sharif University of Technology | 2022
45
Viewed
- Type of Document: Article
- DOI: 10.1109/ICASSP43922.2022.9746679
- Publisher: Institute of Electrical and Electronics Engineers Inc , 2022
- Abstract:
- Detecting emotions directly from a speech signal plays an important role in effective human-computer interactions. Existing speech emotion recognition models require massive computational and storage resources, making them hard to implement concurrently with other machine-interactive tasks in embedded systems. In this paper, we propose an efficient and lightweight fully convolutional neural network for speech emotion recognition in systems with limited hardware resources. In the proposed FCNN model, various feature maps are extracted via three parallel paths with different filter sizes. This helps deep convolution blocks to extract high-level features, while ensuring sufficient separability. The extracted features are used to classify the emotion of the input speech segment. While our model has a smaller size than that of the state-of-the-art models, it achieves a higher performance on the IEMOCAP and EMO-DB datasets. The source code is available https://github.com/AryaAftab/LIGHT-SERNET. © 2022 IEEE
- Keywords:
- Mel frequency Cepstrum coefficient (MFCC) ; Convolution ; Convolutional neural networks ; Emotion Recognition ; Human computer interaction ; Speech recognition ; Computational resources ; Convolutional neural network ; Embedded-system ; Lightweight model ; Mel frequency cepstrum coefficient ; Mel frequency cepstrum coefficients ; Recognition models ; Speech emotion recognition ; Speech signals ; Storage resources ; Embedded systems
- Source: 47th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022, 23 May 2022 through 27 May 2022 ; Volume 2022-May , 2022 , Pages 6912-6916 ; 15206149 (ISSN); 9781665405409 (ISBN)
- URL: https://ieeexplore.ieee.org/document/9746679
