Publications
Short-time signal representation by nonlinear difference equations
Summary
Summary
The solution of a nonlinear difference equation can take on complicated deterministic behavior which appears to be random for certain values of the equation's coefficients. Due to the sensitivities to initial conditions of the output of such "chaotic" systems, it is difficult to duplicate the waveform structure by parameter analysis...
Noise reduction using a soft-decision sine-wave vector quantizer
Summary
Summary
The need for noise reduction arises in speech communication channels, such as ground-to-air transmission and ground-based cellular radio, to improve vocoder quality and speech recognition accuracy. In this paper, noise reduction is performed in the context of a high-quality harmonic serc-phase sine-wave analysis/synthesis system which is characterized by sine-wave amplitudes...
Automatic talker activity labeling for co-channel talker interference suppression
Summary
Summary
This paper describes a speaker activity detector taking co-channel speech as input and labeling intervals of the input as target-only, jammer-only, or two-speaker (target+jammer). The algorithms applied were borrowed primarily from speaker recognition, thereby allowing us to use speaker-dependent test-utterance-independent information in a front-end for co-channel talker interference suppression. Parameters...
Robust speech recognition using hidden Markov models: overview of a research program
Summary
Summary
This report presents an overview of a program of speech recognition research which was initiated in 1985 with the major goal of developing techniques for robust high performance speech recognition under the stress and noise conditions typical of a military aircraft cockpit. The work on recognition in stress and noise...
An approach to co-channel talker interference suppression using a sinusoidal model for speech
Summary
Summary
This paper describes a new approach to co-channel talker interference suppression on a sinusoidal representation of speech. The technique fits a sinusoidal model to additive vocalic speech segments such that the least mean-squared error between the model and the summed waveforms is obtained. Enhancement is achieved by synthesizing a waveform...
Spoken language systems
Summary
Summary
Spoken language is the most natural and common form of human-human communication, whether face to face, over the telephone, or through various communication media such as radio and television. In contrast, human-machine interaction is currently achieved largely through keyboard strokes, pointing, or other mechanical means, using highly stylized languages. Communication...
Far-echo cancellation in the presence of frequency offset (full duplex modem)
Summary
Summary
In this paper, we present a design for a full-duplex echo-cancelling data modem based on a combined adaptive reference algorithm and adaptive channel equalizer. The adaptive reference algorithm has the advantage that interference to the echo canceller caused by the far-end signal can be eliminated by subtracting an estimate of...
Phase coherence in speech reconstruction for enhancement and coding applications
Summary
Summary
It has been shown that an analysis-synthesis system based on a sinusoidal representation leads to synthetic speech that is essentially perceptually indistinguishable from the original. A change in speech quality has been observed, however, when the phase relation of the sine waves is altered. This occurs in practice when sine...
Speech-state-adaptive simulation of co-channel talker interference suppression
Summary
Summary
A co-channel talker interference suppression system processes an input waveform containing the sum of two simultaneous speech signals, referred to as the target and the jammer, to produce a waveform estimate of the target speech signal alone. This paper describes the evaluation of a simulated suppression system performing ideal suppression...
Review of neural networks for speech recognition
Summary
Summary
The performance of current speech recognition systems is far below that of humans. Neural nets offer the potential of providing massive parallelism, adaptation, and new algorithmic approaches to problems in speech recognition. Initial studies have demonstrated that multi-layer networks with time delays can provide excellent discrimination between small sets of...