Publication Abstract

Reynolds, D. A., Campbell, J. P., Campbell, W. M., Dunn, R. B., Gleason, T. P., Jones, D. A., Quatieri, T. F., Quillen, C. B., Sturim, D. E., and Torres-Carrasquillo, P. A., Beyond Cepstra: Exploiting High-Level Information in Speaker Recognition. In Proc. Workshop on Multimodal User Authentication in Santa Barbara, California, pp. 223-229, 11-12 December 2003.

Abstract

Traditionally, speaker recognition techniques have focused on using short-term, low-level acoustic information such as cepstra features extracted over 20-30 ms windows of speech. But speech is a complex behavior conveying more information about the speaker than merely the sounds that are characteristic of his vocal apparatus. This higher-level information includes speaker-specific prosodics, pronunciations, word usage, and conversational style. In this paper, we review some of the techniques to extract and apply these sources of high-level information with results from the NIST 2003 Extended Data Task.