Speech enhancement using sparse convolutive non-negative matrix factorization with basis adaptation

September 9, 2012

Conference Paper

Author:

Michael A. Carlin

…

Published in:

INTERSPEECH 2012: 13th Annual Conf. of the Int. Speech Communication Assoc., 9-13 September 2012.

R&D Area:

Cyber Security and Information Sciences

R&D Group:

Speech enhancement using sparse convolutive non-negative matrix factorization with basis adaptation

Summary

We introduce a framework for speech enhancement based on convolutive non-negative matrix factorization that leverages available speech data to enhance arbitrary noisy utterances with no a priori knowledge of the speakers or noise types present. Previous approaches have shown the utility of a sparse reconstruction of the speech-only components of an observed noisy utterance. We demonstrate that an underlying speech representation which, in addition to applying sparsity, also adapts to the noisy acoustics improves overall enhancement quality. The proposed system performs comparably to a traditional Wiener filtering approach, and the results suggest that the proposed framework is most useful in moderate- to low-SNR scenarios.