Loading...

Two multimodal approaches for single microphone source separation

Sedighin, F ; Sharif University of Technology | 2016

614 Viewed
  1. Type of Document: Article
  2. DOI: 10.1109/EUSIPCO.2016.7760220
  3. Publisher: European Signal Processing Conference, EUSIPCO , 2016
  4. Abstract:
  5. In this paper, the problem of single microphone source separation via Nonnegative Matrix Factorization (NMF) by exploiting video information is addressed. Respective audio and video modalities coming from a single human speech usually have similar time changes. It means that changes in one of them usually corresponds to changes in the other one. So it is expected that activation coefficient matrices of their NMF decomposition are similar. Based on this similarity, in this paper the activation coefficient matrix of the video modality is used as an initialization for audio source separation via NMF. In addition, the mentioned similarity is used for post-processing and for clustering the rows of the activation coefficient matrix which were resulted from randomly initialized NMF. Simulation results confirm the effectiveness of the proposed multimodal approaches in single microphone source separation
  6. Keywords:
  7. Single microphone source separation ; Chemical activation ; Factorization ; Matrix algebra ; Microphones ; Separation ; Signal processing ; Audio and video ; Audio source separation ; Coefficient matrix ; Multi-modal approach ; Multimodal source separation ; Nonnegative matrix factorization ; Post processing ; Video information ; Source separation
  8. Source: European Signal Processing Conference, 28 August 2016 through 2 September 2016 ; Volume 2016-November , 2016 , Pages 110-114 ; 22195491 (ISSN ; 9780992862657 (ISBN)
  9. URL: http://ieeexplore.ieee.org/document/7760220