Publications
An integrated speech-background model for robust speaker identification
Summary
Summary
This paper examines a procedure for text independent speaker identification in noisy environments where the interfering background signals cannot be characterized using traditional broadband or impulsive noise models. In the procedure, both the speaker and the background processes are modeled using mixtures of Gaussians. Speaker and background models are integrated...
Shape invariant time-scale and pitch modification of speech
Summary
Summary
The simplified linear model of speech production predicts that when the rate of articulation is changed, the resulting waveform takes on the appearance of the original, except for a change in the time scale. The goal of this paper is to develop a time-scale modification system that preserves this shape-invariance...
Opportunities for advanced speech processing in military computer-based systems
Summary
Summary
This paper presents a study of military applications of advanced speech processing technology which includes three major elements: 1) review and assessment of current efforts in military applications of speech technology; 2) identification of opportunities for future military applications of advanced speech technology; and 3) identification of problem areas where...
Low-rate speech coding based on the sinusoidal model
Summary
Summary
One approach to the problem of representation of speech signals is to use the speech production model in which speech is viewed as the result of passing a glottal excitation waveform through a time-varying linear filter that models the resonant characteristics of the vocal tract. In many applications it suffices...
Speech nonlinearities, modulations, and energy operators
Summary
Summary
In this paper, we investigate an AM-FM model for representing modulations in speech resonances. Specifically, we propose a frequency modulation (FM) model for the time-varying formants whose amplitude varies as the envelope of an amplitude-modulated (AM) signal. To detect the modulations we apply the energy operator (psi)(x) = (x)^2 -...
Peak-to-rms reduction of speech based on a sinusoidal model
Summary
Summary
In a number of applications, a speech waveform is processed using phase dispersion and amplitude compression to reduce its peak-to-rms ratio so as to increase loudness and intelligibility while minimizing perceived distortion. In this paper, a sinusoidal-based analysis/synthesis system is used to apply a radar design solution to the problem...
Short-time signal representation by nonlinear difference equations
Summary
Summary
The solution of a nonlinear difference equation can take on complicated deterministic behavior which appears to be random for certain values of the equation's coefficients. Due to the sensitivities to initial conditions of the output of such "chaotic" systems, it is difficult to duplicate the waveform structure by parameter analysis...
Noise reduction using a soft-decision sine-wave vector quantizer
Summary
Summary
The need for noise reduction arises in speech communication channels, such as ground-to-air transmission and ground-based cellular radio, to improve vocoder quality and speech recognition accuracy. In this paper, noise reduction is performed in the context of a high-quality harmonic serc-phase sine-wave analysis/synthesis system which is characterized by sine-wave amplitudes...
Automatic talker activity labeling for co-channel talker interference suppression
Summary
Summary
This paper describes a speaker activity detector taking co-channel speech as input and labeling intervals of the input as target-only, jammer-only, or two-speaker (target+jammer). The algorithms applied were borrowed primarily from speaker recognition, thereby allowing us to use speaker-dependent test-utterance-independent information in a front-end for co-channel talker interference suppression. Parameters...
Robust speech recognition using hidden Markov models: overview of a research program
Summary
Summary
This report presents an overview of a program of speech recognition research which was initiated in 1985 with the major goal of developing techniques for robust high performance speech recognition under the stress and noise conditions typical of a military aircraft cockpit. The work on recognition in stress and noise...