A hybrid SVM/MCE training approach for vector space topic identification of spoken audio recordings

September 22, 2008

Conference Paper

Author:

Timothy J. Hazen

…

Frederick S. Richardson

Published in:

INTERSPEECH 2008, 22-26 September 2008, pp. 2542-2545.

R&D Area:

Cyber Security and Information Sciences

R&D Group:

Artificial Intelligence Technology and Systems

A hybrid SVM/MCE training approach for vector space topic identification of spoken audio recordings

Summary

The success of support vector machines (SVMs) for classification problems is often dependent on an appropriate normalization of the input feature space. This is particularly true in topic identification, where the relative contribution of the common but uninformative function words can overpower the contribution of the rare but informative content words in the SVM kernel function score if the feature space is not normalized properly. In this paper we apply the discriminative minimum classification error (MCE) training approach to the problem of learning an appropriate feature space normalization for use with an SVM classifier. Results are presented showing significant error rate reductions for an SVM-based system on a topic identification task using the Fisher corpus of audio recordings of human conversations.