Publications

Refine Results

(Filters Applied) Clear All

Eigen-channel compensation and discriminatively trained Gaussian mixture models for dialect and accent recognition

Published in:
Proc. INTERSPEECH 2008, 22-26 September 2008, pp. 723-726.

Summary

This paper presents a series of dialect/accent identification results for three sets of dialects with discriminatively trained Gaussian mixture models and feature compensation using eigen-channel decomposition. The classification tasks evaluated in the paper include: 1)the Chinese language classes, 2) American and Indian accented English and 3) discrimination between three Arabic dialects. The first two tasks were evaluated on the 2007 NIST LRE corpus. The Arabic discrimination task was evaluated using data derived from the LDC Arabic set collected by Appen. Analysis is performed for the English accent problem studied and an approach to open set dialect scoring is introduced. The system resulted in equal error rates at or below 10% for each of the tasks studied.
READ LESS

Summary

This paper presents a series of dialect/accent identification results for three sets of dialects with discriminatively trained Gaussian mixture models and feature compensation using eigen-channel decomposition. The classification tasks evaluated in the paper include: 1)the Chinese language classes, 2) American and Indian accented English and 3) discrimination between three Arabic...

READ MORE

The MITLL NIST LRE 2007 language recognition system

Summary

This paper presents a description of the MIT Lincoln Laboratory language recognition system submitted to the NIST 2007 Language Recognition Evaluation. This system consists of a fusion of four core recognizers, two based on tokenization and two based on spectral similarity. Results for NIST?s 14-language detection task are presented for both the closed-set and open-set tasks and for the 30, 10 and 3 second durations. On the 30 second 14-language closed set detection task, the system achieves a 1% equal error rate.
READ LESS

Summary

This paper presents a description of the MIT Lincoln Laboratory language recognition system submitted to the NIST 2007 Language Recognition Evaluation. This system consists of a fusion of four core recognizers, two based on tokenization and two based on spectral similarity. Results for NIST?s 14-language detection task are presented for...

READ MORE

Two protocols comparing human and machine phonetic discrimination performance in conversational speech

Published in:
INTERSPEECH 2008, 22-26 September 2008, pp. 1630-1633.

Summary

This paper describes two experimental protocols for direct comparison on human and machine phonetic discrimination performance in continuous speech. These protocols attempt to isolate phonetic discrimination while controlling for language and segmentation biases. Results of two human experiments are described including comparisons with automatic phonetic recognition baselines. Our experiments suggest that in conversational telephone speech, human performance on these tasks exceeds that of machines by 15%. Furthermore, in a related controlled language model control experiment, human subjects were better able to correctly predict words in conversational speech by 45%.
READ LESS

Summary

This paper describes two experimental protocols for direct comparison on human and machine phonetic discrimination performance in continuous speech. These protocols attempt to isolate phonetic discrimination while controlling for language and segmentation biases. Results of two human experiments are described including comparisons with automatic phonetic recognition baselines. Our experiments suggest...

READ MORE

Beyond frame independence: parametric modelling of time duration in speaker and language recognition

Published in:
INTERSPEECH 2008, 22-26 September 2008, pp. 767-770.

Summary

In this work, we address the question of generating accurate likelihood estimates from multi-frame observations in speaker and language recognition. Using a simple theoretical model, we extend the basic assumption of independent frames to include two refinements: a local correlation model across neighboring frames, and a global uncertainty due to train/test channel mismatch. We present an algorithm for discriminative training of the resulting duration model based on logistic regression combined with a bisection search. We show that using this model we can achieve state-of-the-art performance for the NIST LRE07 task. Finally, we show that these more accurate class likelihood estimates can be combined to solve multiple problems using Bayes' rule, so that we can expand our single parametric backend to replace all six separate back-ends used in our NIST LRE submission for both closed and open sets.
READ LESS

Summary

In this work, we address the question of generating accurate likelihood estimates from multi-frame observations in speaker and language recognition. Using a simple theoretical model, we extend the basic assumption of independent frames to include two refinements: a local correlation model across neighboring frames, and a global uncertainty due to...

READ MORE

Detection probability modeling for airport wind-shear sensors

Author:
Published in:
MIT Lincoln Laboratory Report ATC-340

Summary

An objective wind-shear detection probability estimation model is developed for radar, lidar, and sensor combinations. The model includes effects of system sensitivity, site-specific wind-shear, clutter, and terrain blockage characteristics, range-aliased obscuration statistics, antenna beam filling and attenuation, and signal processing differences which allow a sensor- and site-specific performance analysis of deployed and future systems. A total of 161 sites are analyzed for the study, consisting of airports currently serviced by the Terminal Doppler Weather Radar (TDWR) (46), Airport Surveillance Radar Weather Systems Processor (ASR-9 WSP) (35), Low Altitude Wind Shear Alert System-Relocation/Sustainment (LLWAS-RS) (40), and no wind-shear detection system (40). Sensors considered are the TDWR, WSP, LLWAS, Weather Surveillance Radar 1988-Doppler (WSR-88D, commonly known as NEXRAD), adn the Lockheed Martin Coherent Technologies (LMCT) Doppler lidar and proposed x-band radar. [not complete]
READ LESS

Summary

An objective wind-shear detection probability estimation model is developed for radar, lidar, and sensor combinations. The model includes effects of system sensitivity, site-specific wind-shear, clutter, and terrain blockage characteristics, range-aliased obscuration statistics, antenna beam filling and attenuation, and signal processing differences which allow a sensor- and site-specific performance analysis of...

READ MORE

Amplitude spectroscopy of a solid-state artificial atom

Summary

The energy-level structure of a quantum system, which has a fundamental role in its behaviour, can be observed as discrete lines and features in absorption and emission spectra. Conventionally, spectra are measured using frequency spectroscopy, whereby the frequency of a harmonic electromagnetic driving field is tuned into resonance with a particular separation between energy levels. Although this technique has been successfully employed in a variety of physical systems, including natural and artificial atoms and molecules, its application is not universally straightforward and becomes extremely challenging for frequencies in the range of tens to hundreds of gigahertz. Here we introduce a complementary approach, amplitude spectroscopy, whereby a harmonic driving field sweeps an artificial atom through the avoided crossings between energy levels at a fixed frequency. Spectroscopic information is obtained from the amplitude dependence of the system's response, thereby overcoming many of the limitations of a broadband-frequency-based approach. The resulting 'spectroscopy diamonds', the regions in parameter space where transitions between specific pairs of levels can occur, exhibit interference patterns and population inversion that serve to distinguish the atom's spectrum. Amplitude spectroscopy provides a means of manipulating and characterizing systems over an extremely broad bandwidth, using only a single driving frequency that may be orders of magnitude smaller than the energy scales being probed.
READ LESS

Summary

The energy-level structure of a quantum system, which has a fundamental role in its behaviour, can be observed as discrete lines and features in absorption and emission spectra. Conventionally, spectra are measured using frequency spectroscopy, whereby the frequency of a harmonic electromagnetic driving field is tuned into resonance with a...

READ MORE

A 64 x 64-pixel CMOS test chip for the development of large-format ultra-high-speed snapshot imagers

Summary

A 64 x 64-pixel test circuit was designed and fabricated in 0.18- m CMOS technology for investigating high-speed imaging with large-format imagers. Several features are integrated into the circuit architecture to achieve fast exposure times with low-skew and jitter for simultaneous pixel snapshots. These features include an H-tree clock distribution with local and global repeaters, single-edge trigger propagation, local exposure control, and current-steering sampling circuits. To evaluate the circuit performance, test structures are periodically located throughout the 64 x 64-pixel device. Measured devices have exposure times that can be varied between 75 ps to 305 ps with skew times for all pixels less than +-3 ps and jitter that is less than +-1.2 ps rms. Other performance characteristics are a readout noise of approximately 115 e- rms and an upper dynamic range of 310,000 e-.
READ LESS

Summary

A 64 x 64-pixel test circuit was designed and fabricated in 0.18- m CMOS technology for investigating high-speed imaging with large-format imagers. Several features are integrated into the circuit architecture to achieve fast exposure times with low-skew and jitter for simultaneous pixel snapshots. These features include an H-tree clock distribution...

READ MORE

Convection diagnosis and nowcasting for oceanic aviation applications

Published in:
Proc. SPIE, Vol. 7088, Remote Sensing Applications for Aviation Weather Hazard Detection and Decision Support, 25 August 2008, 708808.

Summary

An oceanic convection diagnosis and nowcasting system is described whose domain of interest is the region between the southern continental United States and the northern extent of South America. In this system, geostationary satellite imagery are used to define the locations of deep convective clouds through the weighted combination of three independent algorithms. The resultant output, called the Convective Diagnosis Oceanic (CDO) product, is independently validated against space-borne radar and lightning products from the Tropical Rainfall Measuring Mission (TRMM) satellite to ascertain the ability of the CDO to discriminate hazardous convection. The CDO performed well in this preliminary investigation with some limitations noted. Short-term, 1-hr and 2-hr nowcasts of convection location are performed within the Convective Nowcasting Oceanic (CNO) system using a storm tracker. The CNO was found to have good statistical performance at extrapolating existing storm positions. Current work includes the development and implementation of additional atmospheric features for nowcasting convection initiation and to improve nowcasting of mature convection evolution.
READ LESS

Summary

An oceanic convection diagnosis and nowcasting system is described whose domain of interest is the region between the southern continental United States and the northern extent of South America. In this system, geostationary satellite imagery are used to define the locations of deep convective clouds through the weighted combination of...

READ MORE

Applications of a macroscopic model for en route sector capacity

Published in:
AIAA Guidance, Navigation and Control Conf. and Exhibit, 18-21 August 2008.

Summary

Airspace capacity estimates are important both for airspace design and for operational air traffic management. Considerable effort has gone into understanding the complexity factors that reduce sector capacity by increasing controller workload. Yet no analytical means is available for accurately estimating the maximum capacity of an en route sector. The Monitor Alert Parameter (MAP) values that determine the operational traffic limit of en route sectors in the United States account only for workload from inter-sector coordination tasks. We propose a more complete sector capacity model that also accounts for workload from conflict avoidance and recurring tasks. We use mean closing speeds and airspace separation standards to estimate aircraft conflict rates. We estimate the mean controller service times for all three task types by fitting the model against observed peak traffic counts for hundreds of en route airspace volumes in the Northeastern United States. This macroscopic approach provides numerical capacity predictions that closely bound peak observed traffic densities for those airspace volumes. This paper reviews recent efforts to improve the accuracy of the bound by replacing certain global parameters with measured data from individual sectors. It also compares the model capacity with MAP values for sectors in the New York Center. It concludes by illustrating the use of the model to predict the capacity benefits of proposed technological and operational improvements to the air traffic management system.
READ LESS

Summary

Airspace capacity estimates are important both for airspace design and for operational air traffic management. Considerable effort has gone into understanding the complexity factors that reduce sector capacity by increasing controller workload. Yet no analytical means is available for accurately estimating the maximum capacity of an en route sector. The...

READ MORE

Electro-optical system analysis for sense and avoid

Published in:
AIAA Guidance, Navigation, and Control Conf. and Exhibit, 19-21 August 2008.

Summary

This paper presents a parametric analysis of the sense and avoid capability for an electro- optical system on unmanned aircraft. Our sensor analysis is based on simulated encounters from a new U.S. airspace encounter model that provides a comprehensive distribution of typical visual flight rule (VFR) aircraft behavior and encounter geometries. We assess the exchange between the sensor field-of-view shape and detection range with the probability of intruder detection prior to near miss. This assessment also includes a trade-off analysis between field-of-view azimuth angle and probability of detection with fixed tracking technology (i.e. pixel array sensor and tracking algorithm). Initial results suggest that current standards are suitable for detecting larger aircraft but may not be ideal for small aircraft such as ultralights.
READ LESS

Summary

This paper presents a parametric analysis of the sense and avoid capability for an electro- optical system on unmanned aircraft. Our sensor analysis is based on simulated encounters from a new U.S. airspace encounter model that provides a comprehensive distribution of typical visual flight rule (VFR) aircraft behavior and encounter...

READ MORE