Split Time Warping of Speech for Robust Automatic Dialogue Replacement This publication appears in: Ingeniería Electrónica Automática y Comunicaciones Authors: P. Soens and W. Verhelst Volume: 28 Pages: 71-75 Publication Year: 2007
Abstract: In soundtrack production for film and video, new dialogue is often recorded in a studio and used to replace the original dialogue recorded during filming. This dialogue replacement introduces mismatches between the words an audience perceives and the lip movements in the picture. To resolve this problem, synchronization systems have been developed that allow for automatically replacing the original location recordings with re-recorded studio dialogues. However, these systems lack robustness and often deliver time-scaled dialogue that is either insufficiently synchronized with the reference dialogue, of poor quality, or both. In this paper, is proposed an improvement to the robustness of automatic time synchronization of speech, which consists of splitting up the procedure in two steps: a first step determines the timing relationship between the recordings and a second step calculates the desired time scaling.
|