Publications
Speaker verification using support vector machines and high-level features
Summary
Summary
High-level characteristics such as word usage, pronunciation, phonotactics, prosody, etc., have seen a resurgence for automatic speaker recognition over the last several years. With the availability of many conversation sides per speaker in current corpora, high-level systems now have the amount of data needed to sufficiently characterize a speaker. Although...
pMATLAB parallel MATLAB library
Summary
Summary
MATLAB has emerged as one of the languages most commonly used by scientists and engineers for technical computing, with approximately one million users worldwide. The primary benefits of MATLAB are reduced code development time via high levels of abstractions (e.g. first class multi-dimensional arrays and thousands of built in functions)...
Back-illuminated three-dimensionally integrated CMOS image sensors for scientific applications
Summary
Summary
SOI-based active pixel image sensors have been built in both monolithic and vertically interconnected pixel technologies. The latter easily supports the inclusion of more complex pixel circuitry without compromising pixel fill factor. A wafer-scale back-illumination process is used to achieve 100% fill factor photodiodes. Results from 256 x 256 and...
Construction of a phonotactic dialect corpus using semiautomatic annotation
Summary
Summary
In this paper, we discuss rapid, semiautomatic annotation techniques of detailed phonological phenomena for large corpora. We describe the use of these techniques for the development of a corpus of American English dialects. The resulting annotations and corpora will support both large-scale linguistic dialect analysis and automatic dialect identification. We...
A comparison of speaker clustering and speech recognition techniques for air situational awareness
Summary
Summary
In this paper we compare speaker clustering and speech recognition techniques to the problem of understanding patterns of air traffic control communications. For a given radio transmission, our goal is to identify the talker and to whom he/she is speaking. This information, in combination with knowledge of the roles (i.e...
A new kernel for SVM MLLR based speaker recognition
Summary
Summary
Speaker recognition using support vector machines (SVMs) with features derived from generative models has been shown to perform well. Typically, a universal background model (UBM) is adapted to each utterance yielding a set of features that are used in an SVM. We consider the case where the UBM is a...
Improving phonotactic language recognition with acoustic adaptation
Summary
Summary
In recent evaluations of automatic language recognition systems, phonotactic approaches have proven highly effective. However, as most of these systems rely on underlying ASR techniques to derive a phonetic tokenization, these techniques are potentially susceptible to acoustic variability from non-language sources (i.e. gender, speaker, channel, etc.). In this paper we...
Variable projection and unfolding in compressed sensing
Summary
Summary
The performance of linear programming techniques that are applied in the signal identification and reconstruction process in compressed sensing (CS) is governed by both the number of measurements taken and the number of nonzero coefficients in the discrete basis used to represent the signal. To enhance the capabilities of CS...
Multifocal multiphoton microscopy (MMM) at a frame rate beyond 600 Hz
Summary
Summary
We introduce a multiphoton microscope for high-speed three-dimensional (3D) fluorescence imaging. The system combines parallel illumination by a multifocal multiphoton microscope (MMM) with parallel detection via a segmented high-sensitivity charge-couple device (CCD) camera. The instrument consists of a Ti-sapphire laser illuminating a microlens array that projects 36 foci onto the...
Analysis of ground surveillance assets to support Global Hawk airspace access at Beale Air Force Base
Summary
Summary
This study, performed from May 2006 to January 2007 by MIT Lincoln Laboratory, investigated the feasibility of providing ground-sensor-based traffic data directly to Global Hawk operators at Beale AFB. The system concept involves detecting and producing tracks for all cooperative (transponder-equipped) and non-cooperative aircraft from the surface to 18,000 ft...