Publications
Analysis of nonmodal phonation using minimum entropy deconvolution
Summary
Summary
Nonmodal phonation occurs when glottal pulses exhibit nonuniform pulse-to-pulse characteristics such as irregular spacings, amplitudes, and/or shapes. The analysis of regions of such nonmodality has application to automatic speech, speaker, language, and dialect recognition. In this paper, we examine the usefulness of a technique called minimum-entropy deconvolution, or MED, for...
Lincoln Laboratory high-speed solid-state imager technology
Summary
Summary
Massachusetts Institute of Technology, Lincoln Laboratory (MIT LL) has been developing both continuous and burst solid-state focal-plane-array technology for a variety of high-speed imaging applications. For continuous imaging, a 128 ¿ 128-pixel charge coupled device (CCD) has been fabricated with multiple output ports for operating rates greater than 10,000 frames...
Reducing speech coding distortion for speaker identification
Summary
Summary
In this paper, we investigate the degradation of speaker identification performance due to speech coding algorithms used in digital telephone networks, cellular telephony, and voice over IP. By analyzing the difference between front-end feature vectors derived from coded and uncoded speech in terms of spectral distortion, we are able to...
Pitch-scale modification using the modulated aspiration noise source
Summary
Summary
Spectral harmonic/noise component analysis of spoken vowels shows evidence of noise modulations with peaks in the estimated noise source component synchronous with both the open phase of the periodic source and with time instants of glottal closure. Inspired by this observation of natural modulations and of fullband energy in the...
Missing feature theory with soft spectral subtraction for speaker verification
Summary
Summary
This paper considers the problem of training/testing mismatch in the context of speaker verification and, in particular, explores the application of missing feature theory in the case of additive white Gaussian noise corruption in testing. Missing feature theory allows for corrupted features to be removed from scoring, the initial step...
An overview of automatic speaker diarization systems
Summary
Summary
Audio diarization is the process of annotating an input audio channel with information that attributes (possibly overlapping) temporal regions of signal energy to their specific sources. These sources can include particular speakers, music, background noise sources, and other signal source/channel characteristics. Diarization can be used for helping speech recognition, facilitating...
Coherent beam combining of large number of PM fibres in 2-D fibre array
Summary
Summary
Coherent combining of a record 48 PM fibres in a phased array configuration is reported. The resulting Strehl ratio degrades by
Using filter banks to improve interceptor performance against weaving targets
Summary
Summary
It is well known that interceptor performance against a weaving or spiraling target can be improved by use of a special purpose weave guidance law. However the weave guidance law requires knowledge of the target weave frequency. When the target weave frequency is unknown an extended Kalman filter is usually...
An end-to-end demonstration of a receiver array based free-space photon counting communications link
Summary
Summary
NASA anticipates a significant demand for long-haul communications service from deep-space to Earth in the near future. To address this need, a substantial effort has been invested in developing a free-space laser communications system that can be operated at data rates that are 10-1000 times higher than current RF systems...
Toward an interagency language roundtable based assessment of speech-to-speech translation capabilitites
Summary
Summary
We present observations from three exercises designed to map the effective listening and speaking skills of an operator of a speech-to-speech translation system (S2S) to the Interagency Language Roundtable (ILR) scale. Such a mapping is nontrivial, but will be useful for government and military decision makers in managing expectations of...