IEEE WASPAA’17 paper on Joint Diarization and Separation



Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Radu Horaud and Sharon Gannot


We got a paper accepted at IEEE WASPAA’17:Exploting the Intermittency of Speech for Joint Separation and Diarization [1].

Abstract: Natural conversations are spontaneous exchanges involving two or more people speaking in an intermittent manner. Therefore one expects such conversation to have intervals where some of the speakers are silent. Yet, most (multichannel) audio source separation (MASS) methods consider the sound sources to be continuously emitting on the total duration of the processed mixture. In this paper we propose a probabilistic model for MASS where the sources may have pauses. The activity of the sources is modeled as a hidden state, the diarization state, enabling us to activate/de-activate the sound sources at time frame resolution. We plug the diarization model within the spatial covariance matrix model proposed for MASS in [1], and obtain an improvement in performance over the state of the art when separating mixtures with intermittent speakers.

References:

  1. D. Kounades-Bastian, L. Girin, X. Alameda-Pineda, R. Horaud, and S. Gannot, “Exploting the Intermittency of Speech for Joint Separation and Diarization,” in IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, USA, 2017. [ bib pdf ]
    @inproceedings{Kounades-WASPAA-2017,
    author = {Dionyssos Kounades-Bastian and Laurent Girin and Xavier Alameda-Pineda and Radu Horaud and Sharon Gannot},
    title = {Exploting the Intermittency of Speech for Joint Separation and Diarization},
    booktitle = {IEEE Workshop on Applications of Signal Processing to Audio and Acoustics},
    year = {2017},
    address = {New Paltz, USA},
      pdf={http://xavirema.eu/wp-content/papercite-data/pdf/Kounades-WASPAA-2017.pdf}
    }

Category: Research

No responses yet.

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>