Talking Head Detection by Likelihood-Ratio Test

September 12, 2014

Conference Paper

Author:

Carl B. Quillen

…

Kara B. Greenfield
William M. Campbell

Published in:

Second Workshop on Speech, Language, Audio in Multimedia

R&D Area:

Cyber Security and Information Sciences

R&D Group:

Artificial Intelligence Technology and Systems

Talking Head Detection by Likelihood-Ratio Test(220.2 KB)

Summary

Detecting accurately when a person whose face is visible in an audio-visual medium is the audible speaker is an enabling technology with a number of useful applications. The likelihood-ratio test formulation and feature signal processing employed here allow the use of high-dimensional feature sets in the audio and visual domain, and the approach appears to have good detection performance for AV segments as short as a few seconds.