A Visual Silence Detector Constraining Speech Source Separation Host Publication: Finds and Results from the Swedish Cyprus Expedition: A Gender Perspective at the Medelhavsmuseet Authors: I. Gonzalez, I. Ravyse, W. Verhelst, H. Brouckxon, D. Jiang and H. Sahli Publisher: IEEE Computer Society Press Publication Date: Sep. 2009 Number of Pages: 8 ISBN: 978-0-7695-3883-9
Abstract: -We propose an audiovisual source separation algorithm
for speech signals. In our proposed algorithm we first extract
the time segments with low activity of the mouth region
from synchronous video recordings. An automatically selected
optimal classifier is used to detect silent intervals in
these instants of low visual mouth activity. Then, the source
separation problem is formulated and solved for the entire
signal duration. Our approach was tested on two challenging
speech corpora with two speakers and two microphones,
namely in the first corpus separate source signals
were mixed in a simulated room, and the second corpus contains
recorded conversations. The results are promising on
both corpora: with the visual silence detector the performance
of the source separation algorithm, measured by the
signal to noise inference ratio increases.
|