Publications
The SuperSID project : exploiting high-level information for high-accuracy speaker recognition
Summary
Summary
The area of automatic speaker recognition has been dominated by systems using only short-term, low-level acoustic information, such as cepstral features. While these systems have indeed produced very low error rates, they ignore other levels of information beyond low-level acoustics that convey speaker information. Recently published work has shown examples...
Using prosodic and conversational features for high-performance speaker recognition : report from JHU WS'02
Summary
Summary
While there has been a long tradition of research seeking to use prosodic features, especially pitch, in speaker recognition systems, results have generally been disappointing when such features are used in isolation and only modest improvements have been set when used in conjunction with traditional cepstral GMM systems. In contrast...
Evaluation of TDWR range-velocity ambiguity mitigation techniques
Summary
Summary
Range and velocity ambiguities pose significant data quality challenges for the Terminal Doppler Weather Radar (TDWR). For typical pulse repetition frequencies (PRFs) of 1-2 kHz, the radar is subject to both range-ambiguous precipitation returns and velocity aliasing. Experience shows that these are a major contributor to failures of the system's...
A multi-threaded fast convolver for dynamically parallel image filtering
Summary
Summary
2D convolution is a staple of digital image processing. The advent of large format imagers makes it possible to literally ''pave'' with silicon the focal plane of an optical sensor, which results in very large images that can require a significant amount computation to process. Filtering of large images via...
Observations of non-traditional wind shear events at the Dallas/Fort Worth International Airport
Summary
Summary
During the past 20 years there has been great success in understanding and detecting microbursts. These "traditional" wind shear events are most prominent in the summer and are characterized by a two-dimensional, divergent outflow associated with precipitation loading from a thunderstorm downdraft or evaporative cooling from high-based rain clouds. Analysis...
Multi-radar integration to improve en route aviation operations in severe convective weather
Summary
Summary
In this paper, we describe a major new FAA initiative, the Corridor Integrated Weather System (CIWS), to improve convective weather decision support for congested en route airspace and the terminals within that airspace through use of a large, heterogeneous network of weather sensing radars as well as many additional sensors...
Automated forecasting of road conditions and recommended road treatments for winter storms
Summary
Summary
Over the past decade there have been significant improvements in the availability, volume, and quality of the sensors and technology utilized to both capture the current state of the atmosphere and generate weather forecasts. New radar systems, automated surface observing systems, satellites and advanced numerical models have all contributed to...
Marathon evaluation of optical materials for 157-nm lithography
Summary
Summary
We present the methodology and recent results on the longterm evaluation of optical materials for 157-nm lithographic applications. We review the unique metrology capabilities that have been developed for accurately assessing optical properties of samples both online and offline, utilizing VUV spectrophotometry with in situlamp-based cleaning. We describe ultraclean marathon...
Phonetic speaker recognition with support vector machines
Summary
Summary
A recent area of significant progress in speaker recognition is the use of high level features-idiolect, phonetic relations, prosody, discourse structure, etc. A speaker not only has a distinctive acoustic sound but uses language in a characteristic manner. Large corpora of speech data available in recent years allow experimentation with...
Modeling prosodic dynamics for speaker recognition
Summary
Summary
Most current state-of-the-art automatic speaker recognition systems extract speaker-dependent features by looking at short-term spectral information. This approach ignores long-term information that can convey supra-segmental information, such as prosodics and speaking style. We propose two approaches that use the fundamental frequency and energy trajectories to capture long-term information. The first...